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ALIGNMENTS 


RESULT 1 
ABG32540 

ID ABG32540 standard; protein; 157 AA. 
XX 

AC ABG32540; 
XX 

DT 15-NOV-2002 (first entry) 
XX 

DE Human CCR5-based scaffolded fusion protein #2. 
XX 

KW Scaffolded protein; CCR5; HIV; human immunodeficiency virus infection; 

KW ECD; extracellular domain; metal chelating motif; zinc finger protein; 

KW integral membrane protein; soluble loop; intracellular domain; ICD; 

KW gene therapy; immunogen; viral infection; human. 
XX 

OS Homo sapiens. 


OS Synthetic. 
XX 

FH Key Location/Qualifiers 

FT Peptide 1. .19 

FT /label= Signal_peptide 

FT Protein 20. .157 

FT /label= Mature_scaf f olded_protein 

XX 

PN WO200260477-A1. 
XX 

PD 08-AUG-2002. 
XX 

PF 29-JAN-2002; 2002WO-US002377 . 
XX 

PR 31-JAN-2001; 2 001US-02657 82P . 

PR 31-JAN-2001; 2 001US-0265858P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Coleman TA, Mansfield B; 
XX 

DR WPI; 2002-643357/69. 
XX 

PT Novel scaffolded fusion polypeptide useful for therapeutic purposes or 

PT for screening molecules that bind/activate/inhibit/modulate the 

PT polypeptide, comprises a functional polypeptide domain fused to a 

PT scaffold domain. 

XX 

PS Example 2; Page 41; 64pp; English. 
XX 

CC The invention relates to a scaffolded fusion polypeptide comprising a 

CC functional polypeptide domain fused to a scaffold domain, where the 

CC functional polypeptide domain corresponds to a soluble loop of an 

CC integral membrane protein (e.g. human CCR5, a transmembrane receptor 

CC involved in HIV (human immunodeficiency virus) infection) . Also included 

CC are; (1) a polypeptide comprising a scaffold domain; (2) a nucleic acid 

CC encoding the fusion polypeptide; (3) a vector cassette for the expression 

CC of the fusion polypeptide comprising an expression region operably linked 

CC to a promoter, where the expression region comprises a number of 

CC cassettes, each of which encodes a module, domain or strand of the fusion 

CC polypeptide and (4) a host cell comprising the vector or nucleic acid. 

CC The fusion polypeptide is useful for screening molecules that 

CC bind/activate/inhibit/modulate the fusion polypeptide, by expressing the 

CC fusion polypeptide from and identifying a molecule that binds to the 

CC fusion polypeptide. The fusion polypeptide is useful in diagnostic 

CC methods, in assays to identify compounds that interact with loops of 

CC fragments of an extracellular domain (ECD) or an intracellular domain 

CC (ICD) or to rapidly assay the function of mutated portions of mutant 

CC integral membrane proteins without having to produce significant 

CC quantities of the entire mutant integral membrane protein, to generate 

CC antibodies that recognise the integral membrane proteins from which they 

CC are designed, to competitively bind the ligand of a naturally occurring 

CC receptor in vitro or in vivo, to display and/or screen soluble domains 

CC from protein such as integral membrane proteins, to probe the structure 

CC of ECD or ICD, or both, of an integral protein membrane, to modulate the 

CC activity of a receptor in vivo, and for treating or preventing viral 

CC infection, preferably human HIV infection e.g. by gene therapy using the 


CC encoding nucleic acid. The present sequence is a scaffolded protein based 

CC on the ECD region of human CCR5 (not defined) 

XX 

SQ Sequence 157 AA; 

Query Match 100.0%; Score 884; DB 5; Length 157; 

Best Local Similarity 100.0%; Pred. No. 5.1e-84; 

Matches 157; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MKVSVAALSCLMLVTALGSMDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQW 60 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I II I I I 
Db 1 MKVSVAALS CLMLVTALGSMDYQVS S PI YDINYYT SEPCQKINVKQI AAYKCGLCAAAQW 60 

Qy 61 DFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQR 12 0 

I I I I II I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 61 DFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQR 12 0 

Qy 121 VHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVHAA 157 

I I I I I I I I I I I 1 i II I I I I I 1 I I I I I I 1 I I I I I I I 1 I 
Db 121 VHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVHAA 157 


RESULT 2 

ABG32539 

ID 

ABG32539 standard; protein; 138 AA. 

XX 


AC 

ABG32539; 

XX 


DT 

15-NOV-2002 (first entry) 

XX 


DE 

Human CCR5-based scaffolded fusion protein #1. 

XX 


KW 

Scaffolded protein; CCR5; HIV; human immunodeficiency virus infection; 

KW 

ECD; extracellular domain; metal chelating motif; zinc finger protein; 

KW 

integral membrane protein; soluble loop; intracellular domain; ICD; 

KW 

gene therapy; immunogen; viral infection; human. 

XX 


OS 

Homo sapiens. 

OS 

Synthetic. 

XX 


PN 

WO200260477-A1. 

XX 


PD 

08-AUG-2002. 

XX 


PF 

29-JAN-2002; 2002WO-US002377 . 

XX 


PR 

31-JAN-2001; 2001US-0265782P. 

PR 

31-JAN-2001; 2001US-0265858P . 

XX 


PA 

(HUMA-) HUMAN GENOME SCI INC. 

XX 


PI 

Coleman TA, Mansfield B; 

XX 


DR 

WPI; 2002-643357/69. 

XX 


PT 

Novel scaffolded fusion polypeptide useful for therapeutic purposes or 

PT 

for screening molecules that bind/activate/inhibit/modulate the 


PT polypeptide, comprises a functional polypeptide domain fused to a 

PT scaffold domain. 

XX 

PS Example 1; Page 21; 64pp; English. 
XX 

CC The invention relates to a scaffolded fusion polypeptide comprising a 

CC functional polypeptide domain fused to a scaffold domain, where the 

CC functional polypeptide domain corresponds to a soluble loop of an 

CC integral membrane protein (e.g. human CCR5, a transmembrane receptor 

CC involved in HIV (human immunodeficiency virus) infection) . Also included 

CC are; (1) a polypeptide comprising a scaffold domain; (2) a nucleic acid 

CC encoding the fusion polypeptide; (3) a vector cassette for the expression 

CC of the fusion polypeptide comprising an expression region operably linked 

CC to a promoter, where the expression region comprises a number of 

CC cassettes, each of which encodes a module, domain or strand of the fusion 

CC polypeptide and (4) a host cell comprising the vector or nucleic acid. 

CC The fusion polypeptide is useful for screening molecules that 

CC bind/activate/inhibit/modulate the fusion polypeptide, by expressing the 

CC fusion polypeptide from and identifying a molecule that binds to the 

CC fusion polypeptide. The fusion polypeptide is useful in diagnostic 

CC methods, in assays to identify compounds that interact with loops of 

CC fragments of an extracellular domain (ECD) or an intracellular domain 

CC (ICD) or to rapidly assay the function of mutated portions of mutant 

CC integral membrane proteins without having to produce significant 

CC quantities of the entire mutant integral membrane protein, to generate 

CC antibodies that recognise the integral membrane proteins from which they 

CC are designed, to competitively bind the ligand of a naturally occurring 

CC receptor in vitro or in vivo, to display and/or screen soluble domains 

CC from protein such as integral membrane proteins, to probe the structure 

CC of ECD or ICD, or both, of an integral protein membrane, to modulate the 

CC activity of a receptor in vivo, and for treating or preventing viral 

CC infection, preferably human HIV infection e.g. by gene therapy using the 

CC encoding nucleic acid. The present sequence is a scaffolded protein based 

CC on the ECD region of human CCR5 (not defined) 

XX 

SQ Sequence 138 AA; 

Query Match 90.2%; Score 797; DB 5; Length 138; 

Best Local Similarity 100.0%; Pred. No. 5.1e-75; 

Matches 138; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 7 9 


Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 60 


Qy 


8 0 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 139 


Db 


61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 120 


Qy 


14 0 NNCSSSNRLDGHQRVHAA 157 


Db 


121 NNCSSSNRLDGHQRVHAA 138 


RESULT 3 
AAY41280 
ID AAY41280 


standard; protein; 439 AA. 


XX 

AC AAY412 80; 
XX 

DT ll-FEB-2000 (first entry) 
XX 

DE Fusion protein containing human chemokine receptor CCR-5. 
XX 

KW Prostaglandin; PG; E2EP3 receptor; E2EP2 receptor; CCR-5; human; 

KW chemokine receptor; ss2 adrenergic receptor; small G-protein rho; 

KW renal outer medullary potassium ion channel protein; ion-channel protein; 

KW lambda phage repressor protein; G-protein coupled receptor; bacteria; 

KW biochemical; vaccine; inununohis to chemical ; orphan receptor; HIV. 

XX 

OS Synthetic. 

OS Homo sapiens. 
XX 

PN WO9953033-A1. 
XX 

PD 21-OCT-1999. 
XX 

PF 16-APR-1999; 99WO-US008214 . 
XX 

PR 16-APR-1998; 98US-008 1989P . 
XX 

PA (UYVA-) UNIV VANDERBILT. 
XX 

PI Breyer RM, Ma L, Kennedy C; 
XX 

DR WPI; 1999-620416/53. 

DR N-PSDB; AAZ24738. 
XX 

PT New nucleic acid constructs for high level expression of eukaryotic 

PT proteins in bacteria, for producing e.g. chemokine receptor CCR-5 for 

PT preventing HIV infection. 
XX 

PS Disclosure; Page 49-53; 81pp; English. 
XX 

CC The invention provides isolated nucleic acid sequences that encode rabbit 

CC prostaglandin (PG) E2EP3 receptor, human PG E2EP2 receptor, human 

CC chemokine receptor CCR-5, human ss2 adrenergic receptor, rat renal outer 

CC medullary potassium ion channel protein or human small G-protein rho, 

CC together with deduced protein sequences. Also provided is a method for 

CC the production of eukaryotic proteins by culturing bacteria transformed 

CC with vectors containing the above nucleic acid sequences or a nucleic 

CC acid (I) that comprises: (i) first sequence that encodes either a 

CC sequence comprising at least three positively charged amino acids, or a 

CC DNA-binding protein, or a lambda phage repressor protein, placed upstream 

CC of, and in frame with, (ii) a sequence encoding a protein. (I) are used 

CC for recombinant production of eukaryotic proteins, particularly membrane 

CC proteins, G-protein coupled receptors or ion-channel proteins, in 

CC bacteria. These proteins are useful for biochemical or structural studies 

CC ; as therapeutic agents; in diagnostic and screening assays and as 

CC antigens for use in vaccines, and for raising antibodies that are useful 

CC as immunohistochemical markers, e.g. for orphan receptors or ion 

CC channels. Antibodies raised against the chemokine receptor CCR-5 can be 

CC used (when administered as antiserum or generated in vivo) to prevent 

CC entry of human immune deficiency virus (HIV) into cells 


XX 

SQ Sequence 439 AA; 


Query Match 43.0%; Score 380; DB 2; Length 439; 

Best Local Similarity 34.1%; Pred. No. 5.9e-31; 

Matches 103; Conservative 8; Mismatches 31; Indels 160; Gaps 8; 

Qy 1 MKVSVAALSCLMLVTALGSMDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

: II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 7 0 LKVSVEEF HMDYQVSSPI YDINYYTSEPCQKINVKQIAARLLPPLYSLVF 119 

Qy 50 YKCGLC AAAQWDFGNTMC 67 

I I I I I I I I I I I I I I 

Db 120 I FGFVGNMLVI LI LINCKRLKSMTDI YLLNLAI S DLFFLLTVP FWAHYAAAQWDFGNTMC 179 

Qy 68 QHQRVHGHHHHSYKCG 83 

I : I : : I 

Db 180 — QLLTGL YFI GFFS GI FFI I LLT I DRYLAVVHAVFALKARTVT FGWT S VI TWWAVFA 237 

Qy 84 LCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKI 117 

: I I I I 1 I I I I I I M I I I I I I I I I I I I I II I I I I 
Db 238 SLPGIIFTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILK 2 97 

Qy 118 HQRVHGG GGSYKCGLC QEFFGLNNCSSSNRLDGHQ 152 

: : I I I I I I I I II I I I I II I I I 

Db 2 98 TLLRCRNEKKRHRAVRLIFTIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAM 357 

Qy 153 RV 154 

: I 

Db 358 QV 359 


RESULT 4 
AAW27407 

ID AAW27407 standard; protein; 352 AA. 
XX 

AC AAW27407; 
XX 

DT 14-APR-1998 (first entry) 
XX 

DE Human CCR5 . 
XX 

KW Human Cys-Cys chemokine receptor 5; CCR5; human immunodeficiency virus; 

KW type 1; type 2; HIV-1; HIV-2; diagnosis; treatment; prevention; 

KW inflammatory disease; rheumatoid arthritis; glomerulonephritis; asthma; 

KW idiopathic pulmonary fibrosis; psoriasis; viral infection; cancer; 

KW atherosclerosis; autoimmune disorder. 

XX 

OS Homo sapiens. 
XX 

PN WO9732019-A2. 
XX 

PD 04-SEP-1997. 
XX 

PF 28-FEB-1997; 97WO-BE000023 . 
XX 

PR 01-MAR-1996; 96EP-00870021 . 


PR 06-AUG-1996; 96EP-00870102 . 
XX 

PA (EURO-) EUROSCREEN SA. 
XX 

PI Samson M, Parmentier M, Vassart G, Libert F; 
XX 

DR WPI; 1997-479829/44. 

DR N-PSDB; AAT90117. 
XX 

PT Active and inactive forms of human CC chemokine receptor CCR-5 - useful 

PT to diagnose, prevent and/or treat inflammatory disorders, autoimmune 

PT disease and viral infection. 
XX 

PS Claim 4; Fig lb-c; 94pp; English. 
XX 

CC The present sequence is human CC (Cys-Cys) chemokine receptor 5 (CCR5) , 

CC which is stimulated by MIP-1 alpha, MIP-1 beta or RANTES chemokines, but 

CC not by monocyte chemoattractant protein 1 (MCP-1), MCP-2, MCP-3, 

CC interleukin-8 (IL-8) or growth related gene product alpha (GRO alpha) 

CC chemokines. Active CCR-5 is also a receptor of human immunodeficiency 

CC virus type 1 or type 2 (HIV-1 or HIV-2). CCR5 or its cDNA can used to 

CC diagnose, treat and/or prevent inflammatory diseases, e.g. rheumatoid 

CC arthritis, glomerulonephritis, asthma, idiopathic pulmonary fibrosis and 

CC psoriasis, viral infections, especially HIV-1 or HIV-2 infection, cancer, 

CC atherosclerosis and autoimmune disorders 
XX 

SQ Sequence 352 AA; 

Query Match 42.9%; Score 37 9; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I II I I I I I I I I I I I I I I I I I II 

MD YQVS S P I YD I N Y YT SEP CQKI NVKQ I AARLL P P L Y S LVFI FGFVGNML VI L I L I N C KR 60 

YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

1 1 LLT I DRYLAWHAVFALKARTVT FGWT S VI TWWAVFAS LPGI I FT RSQKEGLHYTC 17 8 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I II I I I I I I I I I II I I I I M : : I 

SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 2 38 


Qy 

20 

Db 

1 

Qy 

50 

Db 

61 

Qy 

84 

Db 

119 

Qy 

98 

Db 

179 

Qy 

125 

Db 

239 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I II I I I I I I I I I I I I I : I 


RESULT 5 
AAW27123 

ID AAW27123 standard; protein; 352 AA. 
XX 


AC 

AAW27123; 


XX 



DT 

14-DEC-1997 (first entry) 

XX 



DE 

Human chemokine 

receptor 88C. 

XX 



KW 

Chemokine receptor 88C; atherosclerosis; rheumatoid arthritis; tumour; 

KW 

asthma; viral infection; AIDS; inflammation; autoimmune disease; therapy 

KW 

diagnosis; leukocyte trafficking; G protein coupled receptor; ligand; 

KW 

modulator; antibody; human. 

XX 



OS 

Homo sapiens . 


XX 



FH 

Key 

Location/Qualifiers 

FT 

Domain 

1. .32 

FT 


/label= Extracellular_domain 

FT 

Domain 

56. .67 

FT 


/label= Intracellular domain 

FT 

Domain 

89. .112 

FT 


/label= Extracellular domain 

FT 

Domain 

125. .145 

FT 


/label= Intracellular domain 

FT 

Domain 

166. .191 

FT 


/label= Extracellular domain 

FT 

Domain 

213. .235 

FT 


/label= Intracellular domain 

FT 

Domain 

259. .280 

FT 


/label= Extracellular domain 

FT 

Domain 

301. .352 

FT 


/label= Intracellular_domain 

XX 



PN 

W09722698-A2. 


XX 



PD 

26-JUN-1997. 


XX 



PF 

20-DEC-1996; 

96WO-US020759. 

XX 



PR 

20-DEC-1995; 

95US-00575967. 

PR 

07-JUN-1996; 

96US-00661393. 

XX 



PA 

(ICOS-) ICOS CORP. 

XX 



PI 

Gray PW, Schweickart VL, Raport CJ; 

XX 



DR 

WPI; 1997-34168 

9/31. 

DR 

N-PSDB; AAT85161. 

XX 



PT 

New nucleic acid encoding chemokine receptors 88-2B and 88C - used to 

PT 

modulate leukocyte trafficking, e.g. for treatment of inflammation, 

PT 

tumours, viral 

infections, auto-immune diseases, etc. 

XX 



PS 

Claim 16; Page 

47-48; 65pp; English. 

XX 



CC 

This polypeptide sequence comprises novel human chemokine receptor 8 8C, < 

CC 

G protein coupled receptor that is involved in leukocyte trafficking. It: 

CC 

amino sequence 

was deduced from a cDNA clone (AAT85161) isolated from a 

CC 

macrophage library. It shows 62% identity to CCCKR1 . Chemokine receptor 


CC 88-2B (see AAW27124) has also been identified. 88C and 88-2B receptors 

CC and their polypeptide fragments can be produced in transformed host 

CC cells. The receptors, peptides comprising one or more of the 

CC extracellular or intracellular domains, and anti-receptor antibodies can 

CC be used to modulate receptor activities, particularly ligand and G 

CC protein binding, and are potentially potentially useful in the treatment 

CC of atheroslerosis, rheumatoid arthritis, tumours, asthma, viral 

CC infection, AIDS, inflammatory conditions, pathological immune response, 

CC abnormal haematopoietic processes etc 

XX 

SQ Sequence 352 AA; 

Query Match 42.9%; Score 379; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I II I I I I I I I II I I I I I II I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 8 4 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYIAWHAVFALKARTVTFGV^SVITWWAVFASLPGIIFTRSQKEGLHYTC 17 8 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 154 

I I I I I I I I I I I I I I I 1 I I : I 

Db 239 TIMIVYFLFWAP YNIVLLLNTFQEFFGLNNCSS SNRLDQAMQV 2 81 


RESULT 6 
AAW23835 

ID AAW23835 standard; protein; 352 AA. 
XX 

AC AAW23835; 
XX 

DT 08-JUN-1998 (first entry) 
XX 

DE Human CC chemokine receptor 5 (CCR5) . 
XX 

KW CC chemokine receptor 5; CCR5; G-protein coupled receptor; 

KW human immunodeficiency virus; HIV; CD4 ; AIDS; therapy; transgenic animal. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT Domain 29. .55 

FT /label= I 

FT /note= "transmembrane domain" 

FT Domain 104. .12 6 


FT /label= III 

FT /note= "transmembrane domain" 

FT Region 109. .120 

FT /note= "extracellular loop-1 (Claim 19)" 

FT Domain 143. .171 

FT /label= IV 

FT /note= "transmembrane domain" 

FT Region 187. .210 

FT /note- "extracellular loop-2 (Claim 19)" 

FT Domain 194. .219 

FT /label= V 

FT /note- "transmembrane domain" 

FT Domain 238. .258 

FT /label- VI 

FT /note- "transmembrane domain" 

FT Region 261. .276 

FT /note= "extracellular loop-3 (Claim 19)" 

FT Domain 277. .300 

FT /label- VII 

FT /note- "transmembrane domain" 
XX 

PN W09745543-A2. 
XX 

PD 04-DEC-1997. 
XX 

PF 28-MAY-1997; 97WO-US009586. 
XX 

PR 28-MAY-1996; 96US-0018508P . 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Combadiere C, Feng Y, Berger EA, Alkhatib G, Murphy PM; 

PI Broder CC, Kennedy PE; 

XX 

DR WPI; 1998-032650/03. 

DR N-PSDB; AAT76920. 
XX 

PT CC chemokine receptor 5 polypeptide - used to inhibit membrane fusion 

PT between HIV and a target cell. 

XX 

PS Claim 68; Fig 1C; 70pp; English. 
XX 

CC This protein sequence comprises of a novel human macrophage-selective CC 

CC chemokine receptor that has been designated CCR5 . The sequence was 

CC deduced from an isolated cDNA clone (see AAT76920) . An Alal27Leu variant 

CC (see W238340 of CCR5 was also identified. The susceptibility of human 

CC macrophages to HIV infection depends on cell surface expression of CD4 

CC and CCR5 . CCR5 is a member of the 7-transmembrane superfamily of G- 

CC protein coupled cell surface molecules. It plays an essential role in the 

CC membrane fusion step of infection by some HIV isolates. The establishment 

CC of stable, non-human cell lines and transgenic mammals having cells that 

CC coexpress human CD4 and CCR5 provides valuable tools for research of HIV 

CC infection. Antibodies that bind to CCR5 , CCR5 variants, and CCR5-binding 

CC agents capable of blocking membrane fusion between HIV and target cells 

CC represent potential anti-HIV therapeutics for macrophage tropic strains 

CC of HIV 

XX 


SQ Sequence 352 AA; 


Query Match 42.9%; Score 37 9; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVS S PI YDIN YYT S EPCQKINVKQIAA 4 9 

I I I I M I I I I II I I I I I I I I I I I I I I I I I I 

Db 1 MDYQVS S PI YDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVI LI LINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

II II I I I I II I II I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 8 4 LCTRSQKEGLHYTC 97 

: I II I I I I I I I I I 

Db 119 IILLTIDRYIAWHAVFALKARTWFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

II I I II I I I II I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 7 
AAW88232 

ID AAW88232 standard; protein; 352 AA. 
XX 

AC AAW88232; 
XX 

DT 15-MAR-1999 (first entry) 
XX 

DE HIV-1 co-receptor CCR5 . 
XX 

KW HIV-1; CCR5; CCR5m303; co-receptor; infection; diagnosis; AIDS; 

KW gene therapy; human. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualif iers 

FT Domain 32. .56 

FT /note= "transmembrane domain 1" 

FT Domain 67. .87 

FT /note= "transmembrane domain 2" 

FT Misc-dif ference 101 

FT /note= "corresponds to TGT (Cys) in wild-type CCR5 , TGA 

FT (Stop) in CCr5m303" 

FT Domain 103. .124 

FT /note= "transmembrane domain 3" 

FT Domain 142. .167 

FT /note= "transmembrane domain 4" 

FT Domain 200. .223 

FT /note= "transmembrane domain 5" 

FT Domain 236. .2 60 


FT 
FT 
FT 


Domain 


/note= "transmembrane domain 6 
275. .301 

/note= "transmembrane domain 7 


XX 

PN W09854317-A1. 
XX 

PD 03-DEC-1998. 
XX 

PF 29-MAY-1998; 98WO-EP003437 . 
XX 

PR 30-MAY-1997; 97US-0048057P . 
XX 

PA (MOND-) FOND MONDIALE RECH & PREVENTION SIDA. 
XX 

PI Beretta A, Quillent C, Arenzana Siesdedos F, Braun J; 
XX 

DR WPI; 1999-059835/05. 

DR N-PSDB; AAV84126. 
XX 

PT New CCR5 variant protein of the HIV-1 co-receptor - useful in developing 

PT resistance of CCR5-expressing cells to HIV-1 infection. 

XX 

PS Disclosure; Page 34-35; 55pp; English. 
XX 

CC This is the amino acid sequence of wild-type human CCR5, which serves as 

CC a co-receptor for infection by macrophage-tropic (M-tropic) strains of 

CC HIV-1. The invention relates to the identification of a CCR5 variant (see 

CC AAW88231), designated CCR5m303, comprising the first two transmembrane 

CC domains of wild-type CCR5, but lacking transmembrane domains 3-7. The 

CC presence of the CCR5m303 variant with the wild type CCR5 allele shows a 

CC positive correlation with resistance to infection with M-tropic HIV-1 

CC strains , and may indicate slower progression of the disease. The 

CC detection of CCR5 variants may be used to identify individuals at lower 

CC risk of infection relative to the general population who, if infected, 

CC may exhibit slower progression to AIDS. Probes and primers (see AAV84127- 

CC 36) are provided for use in diagnostic methods for detecting the presence 

CC of such variants. A method is provided for inhibiting HIV-1 infection of 

CC a cell expressing the CCR5 receptor. This involves introducing a nucleic 

CC acid encoding a CCR5 variant into the cell, thereby reducing the number 

CC of functional CCR5 molecules present on the cell surface 
XX 

SQ Sequence 352 AA; 

Query Match 42.9%; Score 379; DB 2; Length 352; 
Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 



Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 



Db 


61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 


Qy 


8 4 LCTRSQKEGLHYTC 97 

: I I I 1 I I I I I I I I 


Db 119 IILLTIDRYLAVVHAVFTVLKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I II I I I II I I I I I I I II I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 23 8 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I II : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 

RESULT 8 
AAE07048 

ID AAE07048 standard; protein; 352 AA. 
XX 

AC AAE0704 8; 
XX 

DT 16-OCT-2001 (first entry) 
XX 

DE Human G-protein chemokine receptor ( CCR5 ) HDGNR10 protein #2. 
XX 

KW Human; G-protein chemokine receptor; CCR5; HDGNR10; inflammation; HIV; 

KW human immunodeficiency virus; antimicrobial; vasodilator; vulnerary; 

KW cytostatic; immunosuppressive; nootropic; neuroprotective; gene therapy; 

KW neurodegenerative disorder; Kaposi's sarcoma; autoimmune disease; 

KW rheumatoid arthritis; cancer; breast; ovary; adrenal gland; bone marrow; 

KW gastrointestinal tract; lung; liver; immune disorder; Addison's disease; 

KW haemolytic anaemia; autoimmune thyroiditis; diabetes mellitus; allergy; 

KW multiple sclerosis; ulcerative colitis; Crohn f s disease; wound healing; 

KW cardiovascular disorder; myocardial ischaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO200158916-A2. 
XX 

PD 16-AUG-2001. 
XX 

PF 09-FEB-2001; 2 00 1WO-US004 153 . 
XX 

PR 09-FEB-2000; 2000US-0181258P . 

PR 09-MAR-2000; 2000US-0187999P . 

PR 22-SEP-2000; 2000US-0234336P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Roschke V, Li Y, Ruben SM; 
XX 

DR WPI; 2001-488966/53. 

DR N-PSDB; AAD13299. 
XX 


PT Isolated nucleic acid encoding a human G-protein chemokine receptor 

PT (CCR5) HDGNR10 polypeptide, useful for preventing or treating autoimmune 

PT diseases e.g. rheumatoid arthritis, hyperprolif erative disorders and 

PT neurodegenerative disorders. 

XX 

PS Example 40; Page 504-505; 518pp; English. 


XX 


CC The invention relates to human G-protein chemokine receptor (CCR5) 

CC HDGNR10 polypeptides and polynucleotides. CCR5 HDGNR10 antibodies are 

CC useful for treating, preventing or ameliorating a disease or disorder 

CC associated with inflammation, defective or aberrant chemotaxis of immune 

CC cells, HIV infection (such as Pneumocystis carinii pneumonia or Kaposi's 

CC sarcoma) or defective or aberrant T-cell antigen presenting cell 

CC interaction. The disease or disorder may also be an infectious disease 

CC (e.g. a viral infection such as an early stage HIV infection, a 

CC cytomegalovirus infection, or a poxvirus infection) , an autoimmune 

CC disease (e.g. rheumatoid arthritis) or a neurodegenerative disorder. The 

CC disease or disorder may be associated with aberrant CCR5 expression, lack 

CC of CCR5 function, aberrant CCR5 ligand expression, or lack of CCR5 ligand 

CC function. CCR5 HDGNR10 protein is used as a food additive or preservative 

CC to Increase or decrease storage capabilities. CCR5 HDGNR10 DNA are useful 

CC for chromosome identification and in gene therapy. CCR5 HDGNR10 DNA, 

CC protein, antibodies, agonists and antagonists are also useful in the 

CC diagnosis, treatment and prevention of cancer (breast, ovary, adrenal 

CC gland, bone, bone marrow, gastrointestinal tract, liver, lung, 

CC urogenital); immune disorders (Addison's disease, allergies, autoimmune 

CC haemolytic anaemia, autoimmune thyroiditis, diabetes mellitus, Crohn's 

CC disease, multiple sclerosis, rheumatoid arthritis and ulcerative colitis) 

CC ; cardiovascular disorders (myocardial ischaemias) and wound healing. The 

CC present sequence is human CCR5 HDGNR10 protein 

XX 

SQ Sequence 352 AA; 

Query Match 42.9%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 20 MD YQ VS SPIYDINYYTSEPCQKI N VKQ I AA 49 

I 1 I I II I I I I I I I I I I I I I I I I I I I II I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I II I I I 

Db 119 IILLTIDRYLAVAmAVFALKARTWFGVWSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I II I I I I I I I II I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I II I I I I I I I I I I I : I 

Db 239 TIMI VYFLFWAPYNIVLLLNTFQEFFGLNNCS S SNRLDQAMQV 281 


RESULT 9 
AAG80111 

ID AAG80111 standard; protein; 352 AA. 
XX 

AC AAG80111; 
XX 


DT 17-JAN-2002 (first entry) 
XX 

DE Human CCR5 protein. 
XX 

KW Chemokine; tumour diagnosis; colorectal; prostatic; organ rejection; 

KW inflammation; autoimmune disease; metastasis; bronchial asthma; lupus; 

KW chronic bowel inflammation; rheumatoid arthritis; cytostatic; 

KW antiinflammatory; antiasthmatic; immunosuppressive; dermatological ; 

KW antirheumatic; antiarthritic . 

XX 

OS Homo sapiens. 
XX 

PN WO200172830-A2. 
XX 

PD 04-OCT-2001. 
XX 

PF 02-APR-2001; 2 001WO-EP0037 08 . 
XX 

PR 31-MAR-2000; 2000DE-01016013 . 
XX 

PA (IPFP-) IPF PHARM GMBH. 

PA (FORS/) FORSSMANN U. 
XX 

PI Forssmann W, Adermann K, Heitland A, Spodsberg N; 
XX 

DR WPI; 2001-626256/72. 
XX 

PT Diagnostic agent containing two or more receptor-specific ligands, useful 

PT for detecting tumors, inflammation etc., also therapeutic use of ligand 

PT inhibitors . 
XX 

PS Disclosure; Page 10; 26pp; German. 
XX 

CC This invention describes a novel diagnostic agent (A) comprising at least 

CC two different ligands (I) for receptors (II) that are implicated in 

CC disease. (A) are used for the diagnosis of tumors (especially colorectal 

CC or prostatic), organ rejection, inflammation and autoimmune diseases. 

CC Also inhibitors of (I) are used therapeutically against tumors (and their 

CC metastases), inflammation (particularly bronchial asthma or chronic bowel 

CC inflammation), or autoimmune diseases (rheumatoid arthritis or lupus), 

CC where the ( cardio) vascular , lymphatic, respiratory, nervous, digestive, 

CC endocrine, motor or urogenital systems or skin are affected, and bone 

CC marrow diseases. The products of the invention are chemokine derivatives 

CC which have cytostatic, antiinflammatory, antiasthmatic, 

CC immunosuppressive, dermatological, antirheumatic, antiarthritic. 

CC Chemokines act on specific tumor and inflammatory cells through a 

CC constellation of chemokine receptors (CR) , which control migration and 

CC proliferation of these cells. AAG8 004 5-AAG8 012 8 represent human chemokine 

CC fragments used to illustrate the method of the invention 

XX 

SQ Sequence 352 AA; 

Query Match 42.9%; Score 379; DB 4; Length 352; 
Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 


Qy 


20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 


49 


Db 


I I I I II I I I I I I I I I I I II I I I I I I I I I I I 
1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 5 0 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

II I I I I I I I II I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I II I I I I I I 

Db 119 1 1 L L T I DRY LAWHAVFAL KART VT FGWT S VI TWWAVFAS LPGIIFTRS Q KE GLH YT C 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I 1 I I I I I 1 I I I I i I I II I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I II I II II II I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 10 

AAE04321 

ID 

AAE04321 standard; protein; 352 AA. 

XX 


AC 

AAE04321; 

XX 


DT 

04-SEP-2001 (first entry) 

XX 


DE 

Human chemokine receptor (CKR) , CC-CKR-5 related protein #2. 

XX 


KW 

Human; transformed mammalian cell; CD4; reporter gene; translocation; 

KW 

human immuno deficiency virus; HIV; long terminal repeat; LTR; therapy; 

KW 

chemokine receptor; CKR; cellular dysfunction; HIV infection; cofactor; 

KW 

CC-CKR-5; envelope glycoprotein; anti-HIV. 

XX 


OS 

Homo sapiens . 

XX 


PN 

US6258527-B1. 

XX 


PD 

10-JUL-2001. 

XX 


PF 

21-MAY-1997; 97US- 008 61105 . 

XX 


PR 

2 0-MAY-1996; 96US-0017157P . 

PR 

19-JUN-1996; 96US-0020043P . 

PR 

19-MAY-1997; 97US- 00858660 . 

XX 


PA 

(AARO-) AARON DIAMOND AIDS RES CENT. 

PA 

(UYNY ) UNIV NEW YORK STATE. 

XX 


PI 

Littman DR, Deng H, Ellmeier W, Landau NR, Liu R; 

XX 


DR 

WPI; 2001-417127/44. 

DR 

N-PSDB; AAD08577. 

XX 


PT 

Transformed mammalian cell (I) that contains a CD4 gene, reporter gene 

PT 

and HIV LTR for identification of drugs and antibodies for treatment of 


PT HIV. 
XX 

PS Disclosure; Col 47-50; 37pp; English. 
XX 

CC The present invention relates to a transformed mammalian cell that 

CC contains a gene encoding CD4, a construct encoding a reporter gene under 

CC the regulation of an human immuno deficiency virus (HIV) long terminal 

CC repeat (LTR) and that has been transduced with a vector encoding a human 

CC chemokine receptor (CKR) where the CD4 and the CKR are present on the 

CC cell surface of transformed mammalian cell. The invention is useful for 

CC identifying drugs or antibodies that interfere with the translocation of 

CC HIV into transformed mammalian cell or for identifying a human chemokine 

CC receptor that facilitates the infection of a particular HIV strain into 

CC the transformed mammalian cell. Compounds identified can be used to treat 

CC cellular dysfunction and to prevent or combat HIV infection. The present 

CC sequence is a human chemokine receptor (CKR) , CC-CKR-5 related protein. 

CC CC-CKR-5 is the principal cofactor for entry mediated by the envelope 

CC glycoproteins of primary macrophage-tropic strains of HIV-1 
XX 

SQ Sequence 352 AA; 


Query Match 42.9%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 15 0; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MD YQVS S P I YD I N Y YT S E P CQKI NVKQ I AARL L P P L YS LVFI FGFVGNMLVI L I L I NC KR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 1 1 LLT I DRYLAWHAVFALKART VT FGWT SVI TWWAVFAS LPGI I FTRSQKEGLH YT C 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I II I I I II I I : : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRL IF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 

RESULT 11 
AAE07039 

ID AAE07039 standard; protein; 352 AA. 
XX 

AC AAE07039; 
XX 

DT 16-OCT-2001 (first entry) 
XX 

DE Human G-protein chemokine receptor ( CCR5 ) HDGNR10 protein #2. 
XX 

KW Human; G-protein chemokine receptor; CCR5; HDGNR10; inflammation; HIV; 


KW human immunodeficiency virus; antimicrobial; vasodilator; vulnerary; 

KW cytostatic; immunosuppressive; nootropic; neuroprotective; gene therapy; 

KW neurodegenerative disorder; Kaposi's sarcoma; autoimmune disease; 

KW rheumatoid arthritis; cancer; breast; ovary; adrenal gland; bone marrow; 

KW gastrointestinal tract; lung; liver; immune disorder; Addison's disease; 

KW haemolytic anaemia; autoimmune thyroiditis; diabetes mellitus; allergy; 

KW multiple sclerosis; ulcerative colitis; Crohn's disease; wound healing; 

KW cardiovascular disorder; myocardial ischaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO200158915-A2 . 
XX 

PD 16-AUG-2001. 
XX 

PF 09-FEB-2001; 2001WO-US004 152 . 
XX 

PR 09-FEB-2000; 2000US-018 1258P . 

PR 09-MAR-2000; 2000US-0187999P . 

PR 22-SEP-2000; 2000US-0234336P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Rosen CA, Roschke V, Li Y, Ruben SM; 
XX 

DR WPI; 2001-488965/53. 

DR N-PSDB; AAD13198 . 
XX 

PT Isolated nucleic acid encoding a human G-protein chemokine receptor 

PT (CCR5) HDGNR10 polypeptide, useful for preventing or treating autoimmune 

PT diseases e.g. rheumatoid arthritis, hyperprolif erative disorders and 

PT neurodegenerative disorders. 

XX 

PS Example 40; Page 486-487; 495pp; English. 
XX 

CC The present sequence is human G-protein chemokine receptor (CCR5) HDGNR10 

CC protein. CCR5 HDGNR10 antibodies are useful for treating, preventing or 

CC ameliorating a disease or disorder associated with inflammation, 

CC defective or aberrant chemotaxis of immune cells, HIV infection (such as 

CC Pneumocystis carinii pneumonia or Kaposi's sarcoma) or defective or 

CC aberrant T-cell antigen presenting cell interaction. The disease or 

CC disorder may also be an infectious disease (e.g. a viral infection such 

CC as an early stage HIV infection, a cytomegalovirus infection, or a 

CC poxvirus infection), an autoimmune disease (e.g. rheumatoid arthritis) or 

CC a neurodegenerative disorder. The disease or disorder may be associated 

CC with aberrant CCR5 expression, lack of CCR5 function, aberrant CCR5 

CC ligand expression, or lack of CCR5 ligand function. CCR5 HDGNR10 protein 

CC is used as a food additive or preservative to increase or decrease 

CC storage capabilities. CCR5 HDGNR10 DNA are useful for chromosome 

CC identification and in gene therapy. CCR5 HDGNR10 DNA, protein, 

CC antibodies, agonists and antagonists are also useful in the diagnosis, 

CC treatment and prevention of cancer (breast, ovary, adrenal gland, bone, 

CC bone marrow, gastrointestinal tract, liver, lung, urogenital); immune 

CC disorders (Addison's disease, allergies, autoimmune haemolytic anaemia, 

CC autoimmune thyroiditis, diabetes mellitus, Crohn's disease, multiple 

CC sclerosis, rheumatoid arthritis and ulcerative colitis) ; cardiovascular 

CC disorders (myocardial ischaemias) and wound healing 


XX 

SQ Sequence 352 AA; 


Query Match 42.9%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVS S P I YDINYYT S EP CQKINVKQ I AARLL P PL YS LVFI FGFVGNMLVI L I LI NCKR 60 

Qy 50 YKCGLC -AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

II I I I I I I I I I I I I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYIAVAmAVFALKARTWFGWTSVITWWAVFASLPGIIFTRSQKEGLHYTC 17 8 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I II I I I I I I I II I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 12 
AAB46858 

ID AAB46858 standard; protein; 352 AA. 
XX 

AC AAB46858; 
XX 

DT 16-AUG-2001 (revised) 

DT 02-AUG-2001 (revised) 

DT 04-MAY-2001 (first entry) 
XX 

DE Human HDGNR10 protein. 
XX 

KW HDGNR10; human; G-protein chemokine receptor; antiinflammatory; 

KW immunomodulatory; anticoagulant; antiallergic; immunosuppressive; 

KW cytostatic; antiparasitic; antipsoriatic; antirheumatic; antiarthritic; 

KW vasotropic; gene therapy; haematopoiesis ; wound healing; coagulation; 

KW angiogenesis ; solid tumour; infection; leukemia; growth factor activity; 

KW T-cell mediated autoimmune disease; psoriasis; allergy; atherogenesis ; 

KW anaphylaxis; malignancy; inflammation; histamine; IgE; silicosis; shock; 

KW immunoglobulin E-mediated allergic reaction; rheumatoid arthritis; 

KW prostaglandin-independent fever; bone marrow failure; sarcoidosis; 

KW hyper-eosinophilic syndrome; vulnerary. 

XX 

OS Homo sapiens. 
XX 

PN US2001000241-A1. 
XX 

PD 12-APR-2001. 
XX 


PF 29-NOV-2000; 2 000US-00725285 . 
XX 


95US-00466343. 
98US-00195662. 
99US-00339912. 


PR 06-JUN-1995; 

PR 18-NOV-1998; 

PR 25-JUN-1999; 
XX 

PA (LIYY/) LI Y. 

PA (RUBE/) RUBEN S M. 

XX 

PI Li Y, Ruben SM; 
XX 

DR WPI; 2001-226317/23, 

DR N-PSDB; AAF26390. 
XX 

PT New human G-protein chemokine receptor polypeptides and polynucleotides, 

PT useful for identifying ( ant ) agonists to the G-protein chemokine receptor. 
XX 

PS Claim la; Page 15; 22pp; English. 
XX 

CC This invention describes a novel receptor polypeptide (I) selected from 

CC (i) a fully defined 329 amino acid sequence (II) fully disclosed in the 

CC specification; and (ii) a polypeptide encoded by the cDNA contained in a 

CC plasmid, and fragments, analogs and derivatives of the polypeptide. The 

CC products of the invention have antiinflammatory, immunomodulatory, 

CC anticoagulant, antiallergic, immunosuppressive, vulnerary, cytostatic, 

CC antiparasitic, antipsoriatic, antirheumatic, antiarthritic and vasotropic 

CC activity and can be used for gene therapy. The G-protein chemokine 

CC receptors, HDGNR10, (I) are useful for screening for compounds which 

CC activate or inhibit activation of (I) . The products of the invention can 

CC also be used for stimulating haematopoiesis, wound healing, coagulation, 

CC angiogenesis, treating solid tumours, chronic infections, leukemia, T- 

CC cell mediated autoimmune diseases, parasitic infections, psoriasis, and 

CC stimulating growth factor activity. HDGNR10 is useful for treating 

CC allergy, atherogenesis , anaphylaxis, malignancy, chronic and acute 

CC inflammation, histamine and immunoglobulin E ( IgE) -mediated allergic 

CC reactions, prostaglandin-independent fever, bone marrow failure, 

CC silicosis, sarcoidosis, rheumatoid arthritis, shock and hyper- 

CC eosinophilic syndrome. (N.B. This record was resubmitted to correct 

CC errors in the keyword formatting) 

XX 

SQ Sequence 352 AA; 


Query Match 42.9%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 8 3 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 


Qy 

Db 


84 
119 


97 
178 


Non-endogenous human GPCR protein, SEQ ID NO: 477. 

Human; G protein-coupled receptor; GPCR; non-endogenous; mutant; 
constitutively activated GPCR; agonist; disease. 


Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

M I I I I I I I I I I | | | | M | | . . | 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Q y 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : ! 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 

RESULT 13 
ABB56342 

ID ABB56342 standard; protein; 352 AA. 
XX 

AC ABB56342; 
XX 

DT 18-FEB-2002 (first entry) 
XX 
DE 
XX 
KW 
KW 
XX 

OS Homo sapiens. 
OS Synthetic. 
XX 

PN WO200177172-A2. 
XX 

PD 18-OCT-2001. 
XX 

PF 05-APR-2 001; 2001WO-US011098 . 
XX 

PR 07-APR-2000; 2000US-0195747P . 
XX 

PA (AREN-) ARENA PHARM INC. 
XX 

PI Lehmann-Bruinsma K, Liaw CW, Lin I; 
XX 

DR WPI; 2001-648759/74. 
DR N-PSDB; ABI97978. 
XX 

PT Identifying agonists of G protein-coupled receptors (GPCRs) for use in 
PT disease treatment, comprises contacting candidate compounds with versions 
PT of GPCRs. 
XX 

PS Claim 1; Page 277-278; 394pp; English. 
XX 

CC The invention relates to G protein-coupled receptors (GPCRs) for which 

CC the endogenous ligand has been identified. Non-endogenous constitutively 

CC activated versions of known GPCRs are used in the invention for the 

CC direct identification of candidate compounds as receptor agonists, 

CC inverse agonists or partial agonists. Such agonists are useful as 

CC therapeutic agents for diseases or disorders associated with GPCRs. The 

CC present sequence is a non-endogenous version of a known human GPCR 

XX 

SQ Sequence 352 AA; 


Query Match 42.9%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

QY 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I M I I I I I I I I I I II I I I I I I M M I I 
Db 1 MDYQVS S PI YDIN YYT S EPCQKINVKQIAARLLPPLYS LVFI FGFVGNMLVI LI LINCKR 60 

Q y 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I II I I I I I : I : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

Db 119 1 1 LLT I DRYLAVVHAVFALKART VT FGWT S VI TWWAVFAS LP GI I FT RS QKEGLH YT C 178 

^ 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I : : 1 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVlylVICYSGILKTLLRCRNEKKRHRAKRLIF 238 

Qy 125 " GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I II I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 

RESULT 14 
AAB83354 

ID AAB83354 standard; protein; 352 AA. 
XX 

AC AAB83354; 
XX 

DT 09-OCT-2001 (first entry) 
XX 

DE Human CCR5 protein sequence. 
XX 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

PN EP1118858-A2 . 
XX 

PD 25-JUL-2001. 
XX 

PF 03-JAN-2001; 2001EP-00300020 . 
XX 

PR 12-JAN-2000; 2000GB-00000659 . 
PR 12-JAN-2000; 2 OOOGB-00000661 . 
PR 12-JAN-2000; 2000GB-00000663 . 
XX 

PA (PFIZ ) PFIZER LTD. 
PA (PFIZ ) PFIZER INC. 
XX 

PI Dobbs S, Perros M, Rickett GA; 
XX 

DR WPI; 2001-477088/52. 
DR N-PSDB; AAF87099. 


Chemotactic chemokine receptor 5; gpl20; CD4; therapy; HIV; CCR5; 
human immunodeficiency virus; anti-inflammatory disease; human. 


XX 

PT Determining if an agent can modulate CCR5-gpl20 interaction, comprises 
PT incubating the agent with CCR5 and g P 12 0 and determining if the agent 
PT modulates the interaction. 
XX 

PS Claim 1; Page 110; 113pp; English. 
XX 

CC This sequence represents the human CCR5 protein sequence. The invention 
CC relates to a method for determining whether an agent is capable of 

modulating the interaction of chemotactic chemokine receptor 5 (CCR5) 
with gpl20, comprising incubating the agent with CCR5 and g P 120 and 
determining whether the agent modulates the interaction, where g P 120 is 
associated with CD4 , and where the interaction is a low affinity binding 
CC The method is used to identify an agent capable of modulating the 
CC interaction of CCR5 with gpl2 0. An agent identified by the method is used 
to prepare a pharmaceutical composition for the treatment of a disease or 
condition associated with CCR5 and g P 120 interaction, to treat a subject 
with a disease or condition associated with CCR5 and gpl20 interaction, 
and for preparing a pharmaceutical for treating human immunodeficiency 
CC virus (HIV). It can also be treat anti-inflammatory diseases. The method 

is commercially useful, amenable to high throughput screening, and 
CC detects interaction of gpl20 with cells expressing only CCR5 
XX 

SQ Sequence 352 AA; 

Query Match 42.9%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

20 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAAJ^LLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGI FF 118 


CC 
CC 
CC 
CC 


CC 
CC 
CC 
CC 


Qy 84 

Db 119 


LCTRSQKEGLHYTC 97 
1 1 LLT I DRYLAVVHAVFALKARTVTFG VVT S VITWWAVFAS LPGI I FTRSQKEGLH YTC 178 


QY 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I II I I I | I I I I M . . , 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Q y 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I i I M II I I II I II I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 15 
AAB82948 

ID AAB82948 standard; protein; 352 AA. 
XX 

AC AAB82948; 
XX 

DT 21-DEC-2001 (first entry) 


XX 

cc 
cc 


Human HIV-1 co-receptor CCR5 . 

CCR5; chemokine; co-receptor; human immunodeficiency virus type 1; HIV-1; 
infection; therapy; vaccine; anti-HIV-1. 


XX 
DE 
XX 
KW 
KW 
XX 

OS Homo sapiens. 
XX 

FH Ke Y Location/Qualifiers 
FT Binding-site 2. .18 

FT /note= "binds to HIV-1 gpl20" 

XX 

PN WO200164710-A2. 
XX 

PD 07-SEP-2001. 
XX 

PF 28-FEB-2001; 2001WO-US006699 . 
XX 

PR 29-FEB-2000; 2000US-0185667P . 
PR 19-MAY-2000; 2 00OUS-02 05839P . 
PR 07-FEB-2001; 2 001US-02 67231P . 
XX 

PA (PROG-) PROGENICS PHARM INC. 

PA (AARO-) AARON DIAMOND AIDS RES CENT. 

XX 

PI Dragic T, Olson WC; 
XX 

DR WPI; 2001-611273/70. 
DR N-PSDB; AAH26903. 
XX 
PT 


Novel compounds comprising specific amino acids within CCR5 (HIV 1 co- 
PT receptor) amino terminal domain including negatively charged and two 
PT sulfated tyrosine residues is useful for treating HIV infection in 
PT humans . 


XX 

PS Claim 1; Page 30; 163pp; English. 


The present sequence is that of human HIV-1 co-receptor CCR5 . Amino acids 
2-18 in the N-terminal region of CCR5 comprise an HIV-1 gpl2 0-binding 
CC site that determines the specificity of the interaction between CCR5 and 
CC HIV-1 gpl20. Post-translational sulfation of the tyrosine residues in the 
CC CCR5 N-terminus is required for gpl20 binding and may critically modulate 
CC the susceptibility of target cells to HIV-1 infection in vivo. The 

invention provides claimed sulfated peptides (see AAB82947) that are 
based on the CCR5 N-terminal region and which are effective for 


CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 

j. j — r — — — ~ \ uu^ia^cuun, ilhz: unwia / 

CC infected with HIV (prophylactic method), or in a subject who is not 

CC infected with, but has been exposed to, HIV 
XX 

SQ Sequence 352 AA; 


inhibiting HIV-1 binding to CCR5 . These peptides are used in claimed 
methods of inhibiting HIV infection of CD4 + cells, of preventing CD4 + 
cells from becoming infected with HIV, of treating a subject whose CD4 + 
cells are infected with HIV, and of identifying an agent which inhibits 
binding of a CCR5 ligand to a CCR5 receptor. The methods may be carried 
out in a subject, especially a human, infected (therapeutic method), not 


Query Match 


42.9%; Score 379; DB 4; Length 352; 


Best Local Similarity 35.0%; Pred. No. 5.7e-31; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 


Qy 

20 

Db 

1 

Qy 

50 

Db 

61 

Qy 

84 

Db 

119 

Qy 

98 

Db 

179 

Qy 

125 

Db 

239 


1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : | 


-LCTRSQKEGLHYTC 97 
: I I I I I I I I I I I I 


FY5QYQFWKNFQTLKI HQRVHGG 124 

I I I I M I I I I I I I I I I I I II : : | 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I M I I I I : I 


Search completed: September 17, 2004, 16:45:10 
Job time : 93.539 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 


Run on : 


Title: 

Perfect score: 
Sequence : 

Scoring table: 


September 17, 2004, 16:42:06 ; Search time 24.4814 Seconds 

(without alignments) 
331.080 Million cell updates/sec 

US-10-057-890A-31 
884 

1 MKVSVAALSCLMLVTALGSM GLNNCSSSNRLDGHQRVHAA 157 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 . 


Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 


389414 


Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


Database : Issued_Patents_AA: * 

1: /cgn2_6/ptodata/2/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3 : /cgn2_6/ptodata/2/iaa/6A_COMB. pep : * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep:* 

5 : /cgn2_6/ptodata/2/iaa/PCTUS_COMB . pep : * 

6 : /cgn2_6/ptodata/2/iaa/backf ilesl . pep : * 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 


1 
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9 
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3 

us- 

09- 

087- 
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Sequence 

13, Appl 

2 

379 

42. 

9 

352 

3 

us- 

08- 

861- 
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Sequence 

14, Appl 

3 

379 

42. 

9 
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3 

us- 

08- 

575- 
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Sequence 

2, Appli 

4 

379 

42. 

9 

352 

4 

us- 

08- 

833- 

752-5 

Sequence 

5, Appli 

5 

379 

42. 

9 
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4 

us- 

09- 

502- 
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Sequence 

2, Appli 

6 

379 
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9 
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4 
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Sequence 
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7 
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3 
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3 

us- 

09- 
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Sequence 

52, Appl 
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3 
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4 

us- 

09- 

534- 
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Sequence 

52, Appl 
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2 
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3 

us- 

08- 

466- 

343D-2 

Sequence 

2, Appli 

10 
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6 
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4 

us- 

09- 

517- 

605-5 

Sequence 

5, Appli 

11 
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30. 

8 
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4 

us- 

08- 

724- 
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Sequence 

2, Appli 


12 

258 

29 . 

2 
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4 

us- 

08- 

-833- 

-752-4 

Sequence 

4, Appli 
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29 . 

2 
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3 

us- 

09- 

-087- 

-232A-17 

Sequence 
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4 

us- 

08- 

-833- 
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Sequence 

6, Appli 
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us- 

09- 

-087- 
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Sequence 

15, Appl 

16 
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0 

87 

3 

us- 

09- 

-087- 
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Sequence 

18, Appl 

17 

148.5 

16. 

8 
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2 

us- 

08- 

933- 

750C-17 

Sequence 

17, Appl 

18 

14 8.5 

16 . 

8 

488 

3 

us~ 

09- 

-234- 

613-17 

Sequence 

17, Appl 

19 

128 

14 . 

5 

54 

4 

us- 

08- 

-833- 

752-11 

Sequence 

11, Appl 

20 

128 

14 . 

5 

313 

4 

us- 

09- 

-800- 

729-196 

Sequence 

196 f App 

21 

128 

14 . 

5 

326 

4 

us- 

09- 

800- 

729-195 

Sequence 

195, App 

22 

123 . 5 

14 . 

0 
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4 

us- 

09- 

881- 

578A-2 

Sequence 

2, Appli 

23 
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13 . 

9 

22 

3 

us- 

08- 

861- 

105-4 

Sequence 

4, Appli 

24 
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7 
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4 

us- 

09- 

881- 

578A-4 

Sequence 

4, Appli 

25 

121 

13. 

7 

711 

2 

us- 

08- 

820- 

17 OA- 10 

Sequence 

10, Appl 

26 
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13 . 

7 

711 

3 

us- 

09- 

055- 

699-10 

Sequence 

10, Appl 

27 
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13 , 

7 

711 

3 

us- 

09- 

273- 

5.65-10 

Sequence 

10, Appl 

28 

121 

13 . 

7 

711 

4 

us- 

09- 

565- 

538-10 

Sequence 

10, Appl 

29 

121 

13. 

7 

711 

4 

us- 

09- 

661- 

468-10 

Sequence 

10, Appl 

30 

121 

13. 

7 

711 

4 

us- 

09- 

976- 

165-10 

Sequence 

10, Appl 

31 

120. 5 

13. 

6 

185 

4 

us- 

09- 

494- 
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Sequence 

125, App 

32 

119. 5 

13. 

5 

185 

4 

us- 

09- 

494- 

190-126 

Sequence 

126, App 

33 

118. 5 

13. 

4 

315 

1 

us- 

08- 

253- 

155A-34 

Sequence 

34, Appl 

34 

118 . 5 

13 . 

4 

532 

4 

us- 

09- 

389- 

956-80 

Sequence 

80, Appl 

3,5 

117 

13 . 

2 

457 

4 

us- 

09- 

389- 

956-68 

Sequence 

68, Appl 

36 

117 

13. 

2 

647 

4 

us- 

09- 

389- 

956-6 

Sequence 

6, Appli 

37 

114 . 5 

13. 

0 

543 

4 

us- 

09- 

362- 

123A-4 

Sequence 

4, Appli 

38 

114 . 5 

13. 

0 

675 

1 

us- 

08- 

317- 

52 2 A- 9 

Sequence 

9, Appli 

39 

114 . 5 

13. 

0 

675 

1 

us- 

08- 

439- 

818A-9 

Sequence 

9, Appli 

40 

114 . 5 

13 . 

0 

675 

2 

us- 

08- 

751- 

965-9 

Sequence 

9, Appli 

41 

114.5 

13. 

0 

675 

2 

us- 

08- 

738- 

975-9 

Sequence 

9, Appli 

42 

114.5 

13. 

0 

675 

2 

us- 

08- 

728- 

626-9 

Sequence 

9, Appli 

43 

114.5 

13. 

0 

675 

3 

us- 

08- 

808- 

599A-9 

Sequence 

9, Appli 

44 

114 

12. 

9 

21 

3 

us- 

08- 

907- 

468-11 

Sequence 

11, Appl 

45 

113.5 

12. 

8 

462 

3 

us- 

08- 

486- 

099-117 

Sequence 

117, App 


ALIGNMENTS 


RESULT 1 

US-09-087-232A-13 

; Sequence 13, Application US/09087232A 

; Patent No. 6153431 

; GENERAL INFORMATION: 

APPLICANT: Quillent et al. 
; TITLE OF INVENTION: HUMAN IMMUNODEFICIENCY VIRUS CO-RECEPTOR 

TITLE OF INVENTION: VARIANTS ASSOCIATED WITH RESISTANCE TO VIRUS 
INFECTION. 

NUMBER OF SEQUENCES: 23 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Baker & Botts, L.L.P. attn. Lisa Kole 

STREET: 30 Rockefeller Plaza 
; CITY: New York 

; STATE: New York 

; COUNTRY: USA 

ZIP: 10112 
; COMPUTER READABLE FORM: 


MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS /MS- DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/087 , 232A 
FILING DATE: 28 MAY 1998 
; CLASSIFICATION: 435 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/048,057 
FILING DATE: 30 MAY 1997 
; ATTORNEY/ AGENT INFORMATION: 
NAME: KOLE, LISA B. 
REGISTRATION NUMBER: 35,225 
REFERENCE/DOCKET NUMBER: AP 31115 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 408-2628 
TELEFAX: (212) 765-2519 
; INFORMATION FOR SEQ ID NO: 13: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 352 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-087-232A-13 

Query Match 42.9%; Score 37 9; DB 3; Length 352; 

Best Local Similarity 35.0%; Pred. No. 8.1e-35; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 1 1 LLT I D R YLAWHAVFAL KART VT FGWT S VI TWWAVFAS L P G 1 1 FT RS QKEGLH YT C 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 12 4 

I I I I I I I I I I I I I I I I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I II I I I I I I I II I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 2 

US-08-861-105-14 

; Sequence 14, Application US/08861105 

; Patent No. 6258527 

; GENERAL INFORMATION: 

; APPLICANT: LITTMAN, DAN R. 


APPLICANT: DENG, HONGKUI 
APPLICANT: ELLMEIER, WILFRIED 
APPLICANT: LANDAU, NATHANIEL R. 
APPLICANT: LIU, RONG 

TITLE OF INVENTION: G-COUPLED RECEPTORS ASSOCIATED WITH 

TITLE OF INVENTION: MACRO PHAGE- TROPHIC HIV, AND DIAGNOSTIC AND THERAPEUTIC 
TITLE OF INVENTION: USES THEREOF 
NUMBER OF SEQUENCES: 14 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: David A. Jackson, Esq. 

STREET: 411 Hackensack Ave, Continental Plaza, 4th 
STREET: Floor 
CITY: Hackensack 
STATE: New Jersey 
COUNTRY: USA 
ZIP : 07601 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/861, 105 
FILING DATE: 
CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/666,020 
FILING DATE: 19-JUN-1996 
CLASSIFICATION: 436 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 08/227,319 
FILING DATE: 13-APR-1994 
ATTORNEY/AGENT INFORMATION: 

NAME : Jackson Esq., David A. 
REGISTRATION NUMBER: 26,742 
REFERENCE/ DOCKET NUMBER: 104 9-1-004 Nl 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 2 01-487-58 00 
TELEFAX: 201-343-1684 
INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 352 amino acids 
TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
HYPOTHETICAL: NO 
ORIGINAL SOURCE: 

ORGANISM: Homo sapiens 
US-08-861-105-14 

Query Match 42.9%; Score 379; DB 3; Length 352; 

Best Local Similarity 35.0%; Pred. No. 8.1e-35; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 


20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 
M I I I I I II I I I II I I I I I I I I I I I I I I I I 


49 


Db 


1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 

50 

Db 

61 

Qy 

84 

Db 

119 

Qy 

98 

Db 

179 

Qy 

125 

Db 

239 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I It | : | : : | 


-LCTRSQKEGLHYTC 97 
: I I I I I I I I I I I I 


5SHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I II I I I I I I I | | | M | | : : I 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I II I I I I I I I I I II : I 


RESULT 3 

US-08-575-967A-2 

; Sequence 2, Application US/08575967A 
; Patent No. 6265184 

GENERAL INFORMATION: 

APPLICANT: Gray et al . 

TITLE OF INVENTION: Chemokine Receptor Materials and Methods 
NUMBER OF SEQUENCES: 16 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Marshall, O'Toole, Gerstein, Murray & Borun 
STREET: 6300 Sears Tower, 233 S. Wacker Drive 
; CITY: Chicago 

STATE: Illinois 
; COUNTRY: USA 

ZIP: 60606 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/575, 967A 

; FILING DATE: 

CLASSIFICATION: 435 
; ATTORNEY/AGENT INFORMATION: 
; NAME: No. 6265184and, Greta E. 

REGISTRATION NUMBER: 35,302 

REFERENCE/ DOCKET NUMBER: 32918 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 206-485-1900 

TELEFAX: 206-485-1662 
; INFORMATION FOR SEQ ID NO: 2: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 352 amino acids 

; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
; FEATURE : 


; NAME/ KEY : misc_feature 

; OTHER INFORMATION: /= "88C amino acid sequence" 

US-08-575-967A-2 

Query Match 42.9%; Score 379; DB 3; Length 352; 

Best Local Similarity 35.0%; Pred. No. 8.1e-35; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

QY 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I I I I II I I I I I I I I 

Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Q y 50 — YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 8 3 

I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 8 4 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 1 1 LLT I D R YLAWHAVFAL KART VT F GWT S VI T WWAVFAS LPGIIFTRS Q KE GLH YT C 17 8 

QY 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I :: I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 4 
US-08-833-752-5 

Sequence 5, Application US/08833752 
Patent No. 6448375 
GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 
APPLICANT: PARMENTIER, MARC 
APPLICANT: VASSART, GILBERT 
APPLICANT: LIBERT, FREDERICK 

TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 
TITLE OF INVENTION: AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE : CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8/ 833 , 752 
FILING DATE: 9-APR-1997 
CLASSIFICATION: 536 


ATTORNEY/AGENT INFORMATION: 
NAME: Altaian, Daniel E 
REGISTRATION NUMBER: 34,115 
; REFERENCE/ DOCKET NUMBER: 

INFORMATION FOR SEQ ID NO: 5: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 352 amino acids 

TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-752-5 

Query Match 42.9%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 8.1e-35; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPI YDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 8 3 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAVAmAVFALKARTWFGVWSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I 1 I I I I I I I I I I I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

QY 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I II I I I I I I I I II I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 5 

US-09-502-783A-2 

; Sequence 2, Application US/09502783A 

; Patent No. 6511826 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven M. 

; TITLE OF INVENTION: Polynucleotides Encoding Human G-Protein Chemokine 
Receptor (CCR5) 

; TITLE OF INVENTION: HDGNR10 
; FILE REFERENCE: 1488.1150006 

CURRENT APPLICATION NUMBER: US/ 09/5 02 , 7 83A 
; CURRENT FILING DATE: 2001-08-23 
; PRIOR APPLICATION NUMBER: 08/466,343 
; PRIOR FILING DATE: 1995-06-06 
; NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 2 
; LENGTH: 352 
TYPE: PRT 


; ORGANISM: Homo sapiens 
US-09-502-783A-2 


Query Match 42.9%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 8.1e-35; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I II I I I I I I : I : : | 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I ! II I I I I I I 

Db 119 1 1 LLT I DRYLAVVHAVFALKARTVT FGWT S VI TWVVAVFAS L PGI I FTRSQKEGLH YT C 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I II I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 6 
US-09-796-202-1 

; Sequence 1, Application US/09796202 

; Patent No. 6548636 

; GENERAL INFORMATION: 

; APPLICANT: Dragic, Tat j ana 

; APPLICANT: Olson, William 

; TITLE OF INVENTION: SULFATED CCR5 PEPTIDES FOR HIV-1 INFECTION 

; FILE REFERENCE: 2048/61010/ JPW/SHS 

; CURRENT APPLICATION NUMBER: US/ 09/7 96, 2 02 

; CURRENT FILING DATE: 2001-02-28 

; NUMBER OF SEQ ID NOS : 17 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 1 

LENGTH: 352 

TYPE: PRT 

ORGANISM: human 
US-09-796-202-1 

Query Match 42.9%; Score 379; DB 4; Length 352; 

Best Local Similarity 35.0%; Pred. No. 8.1e-35; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I I I I I I I I I ! II I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I II I I I I I I I I I I : I : : I 


Db 


61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 


Qy 8 4 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db H9 IILLTIDRYIAVVHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

QY 98 S SHFP YSQYQFWKNFQTLKI HQRVHGG-- 124 

I I I II I I I I I I I I I I I I I I I : : | 

Db 179 S SHFP YSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

QY 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I i I i I I I I I I I I I I I I I : I 

239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 7 

US-09-045-583-52 

; Sequence 52, Application US/09045583 
; Patent No. 6287805 

GENERAL INFORMATION: 

APPLICANT: Graham, Gerard J. et al . 

TITLE OF INVENTION: No. 6287805el Molecules of the G Protein-Coupled 
NUMBER OF SEQUENCES: 5 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD, LLP 
STREET: 28 State Street 
; CITY: Boston 

STATE: Massachusetts 
; COUNTRY: USA 

ZIP: 02109 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.25 

; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/045, 583 
; FILING DATE: 20-MAR-98 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
FILING DATE: 
; ATTORNEY/AGENT INFORMATION: 
; NAME: Mandragouras , Amy E. 

; REGISTRATION NUMBER: 36,207 

REFERENCE/ DOCKET NUMBER: MNI-044 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (617)22 7-74 00 
TELEFAX: (617)742-4214 
INFORMATION FOR SEQ ID NO: 52: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 352 amino acids 

TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
FRAGMENT TYPE: internal 
US-09-045-583-52 


Query Match 42.3%; Score 374; DB 3; Length 352; 

Best Local Similarity 34.6%; Pred. No. 3e-34; 

Matches 98; Conservative 8; Mismatches 27; Indels 150; Gaps 7; 

20 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I I : I I I I II I I I I I II I I I I 
1 MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG— 83 

I I I II I I I II I I I I I : I : : I 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

84 LCTRSQKEGLHYTC 97 

: MINIMUM 

119 IILLTIDRYIATVTiAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

M II II II II I I II II II II :: I 

179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I II II II II I I I II : I 

239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


US-09-534-185-52 

; Sequence 52, Application US/09534185 
; Patent No. 6403767 

GENERAL INFORMATION: 

APPLICANT: Graham, Gerard J. et al . 

TITLE OF INVENTION: No. 6403767el Molecules of the G Protein-Coupled 
; Heptahelical Receptor Superfamily and Uses 

; Therefor 
; NUMBER OF SEQUENCES: 56 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: LAHIVE & COCKFIELD, LLP 

STREET : 28 State Street 
; CITY: Boston 

; STATE: Massachusetts 

COUNTRY: USA 
; ZIP: 02109 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/534,185 

FILING DATE: 24-Mar-2000 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/045,583 

FILING DATE: <Unknown> 
; ATTORNEY/AGENT INFORMATION: 

NAME: Mandragouras, Amy E. 


Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

RESULT 8 


REGISTRATION NUMBER: 36,207 
REFERENCE/DOCKET NUMBER: MNI-044 
TELECOMMUNICATION INFORMATION : 
TELEPHONE: (617)227-7400 
TELEFAX: (617)742-4214 
; INFORMATION FOR SEQ ID NO: 52: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 352 amino acids 

TYPE: amino acid 
; TOPOLOGY: linear 

MOLECULE TYPE: peptide 
FRAGMENT TYPE: internal 
SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
US-09-534-185-52 


Query Match 42.3%; Score 374; DB 4; Length 352; 

Best Local Similarity 34.6%; Pred. No. 3e-34; 

Matches 98; Conservative 8; Mismatches 27; Indels 15 0; Gaps 7; 

MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I I : I II I I I I I I I I I I I I I I 


Qy 

20 

Db 

1 

Qy 

50 

Db 

61 

Qy 

84 

Db 

119 

Qy 

98 

Db 

179 

Qy 

125 

Db 

239 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I II I I I I I I : I : : I 


-LCTRSQKEGLHYTC 97 
: I I I I I I I I I I I I 


SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I II I I I I I I I I I : : I 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 


RESULT 9 

US-08-466-343D-2 

; Sequence 2, Application US/08466343D 
; Patent No. 6025154 
; GENERAL INFORMATION: 
APPLICANT: LI, Yi 

TITLE OF INVENTION: POLYNUCLEOTIDES ENCODING HUMAN G-PROTEIN 
; TITLE OF INVENTION: CHEMOKINE RECEPTOR HDGNR10 (AS AMENDED) 

; NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: STERNE, KESSLER, GOLDSTEIN & FOX P.L.L.C. 
STREET: 1100 NEW YORK AVE., NW, SUITE 600 
CITY: WASHINGTON 
STATE: DC 
COUNTRY: USA 
ZIP: 20005 
COMPUTER READABLE FORM: 


; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC- DOS /MS -DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/4 66, 343D 
; FILING DATE: 06-JUN-1995 

; CLASSIFICATION: 435 

ATTORNEY/AGENT INFORMATION: 
NAME: STEFFE, ERIC K. 
; REGISTRATION NUMBER: 36,688 

REFERENCE/DOCKET NUMBER: 1488 . 1150000/EKS/KLM 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (202) 371-2600 

TELEFAX: (202) 371-2540 
; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
TYPE: amino acid 
; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-466-343D-2 


Query Match 42.2%; Score 373; DB 3; Length 352; 

Best Local Similarity 34.6%; Pred. No. 3.9e-34; 

Matches 98; Conservative 7; Mismatches 28; Indels 150; Gaps 7; 

QY 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I i I i I I I I I I I I I I I I I I I I I I I I M I 

Db 1 MDYQVSSPIYDINYYTSEPCPKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCQR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LESMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

QY 84 LCTRSQKEGLHYTC 97 

Db 119 1 1 LLT I DRYLAI VHAVFALKART VT FGWT S VI TWWAVFAS LP GI I FTRSQKEGLH YT C 178 

QY 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I : ; 1 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLvTWICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I M I I I I I I I II I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 10 
US-09-517-605-5 

; Sequence 5, Application US/09517605 

; Patent No. 6391567 

; GENERAL INFORMATION: 

; APPLICANT: Littman, Dan R. 

; APPLICANT: Kwon, Douglas S. 

; APPLICANT: van Kooyk, Yvette 

; APPLICANT: Geijtenbeck, Tneo 


TITLE OF INVENTION: METHODS OF USING A FACILITATOR OF RETROVIRAL ENTRY INTO 
TITLE OF INVENTION: CELLS 
FILE REFERENCE: 1049-1-017 

CURRENT APPLICATION NUMBER: US/ 09/517 , 605 
CURRENT FILING DATE: 2000-03-02 
NUMBER OF SEQ ID NOS : 17 
SOFTWARE: PatentlnVer. 2.0 
SEQ ID NO 5 
LENGTH: 352 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-517-605-5 

Query Match 40.6%; Score 359; DB 4; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.5e-32; 

Matches 96; Conservative 8; Mismatches 29; Indels 150; Gaps 7; 


Qy 

20 

Db 

1 

Qy 

50 

Db 

61 

Qy 

84 

Db 

119 

Qy 

98 

Db 

179 

Qy 

125 

Db 

239 


»t - 1 - j-i^-hn x x x oiLr^i\j.rj vj\<^±Aft. 49 

I I M : I I I I I I I II II I I II I 


- YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 8 3 

I I I I I I I I I I I I I I I : I : : I 


-LCTRSQKEGLHYTC 97 
: I I I I I I I I II II 


FY 3QYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I : : 1 


-GGSYKCGLC QE FFGLNNCS S SNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 


RESULT 11 
US-08-724-984A-2 

; Sequence 2, Application US/08724984A 
; Patent No. 6388055 
; GENERAL INFORMATION: 

APPLICANT: Derk Bergsma, Mary Brawner, and Usman Shabon 
; TITLE OF INVENTION: No. 6388055el Mouse Genomic Clone of the CC- 

TITLE OF INVENTION: CKR5 Receptor 
; NUMBER OF SEQUENCES: 5 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: SmithKline Beecham Corporation 

STREET: 709 Swedeland Road, P.O. Box 1539 
CITY: King of Prussia 
STATE: PA 
; COUNTRY: USA 

ZIP: 19406-0939 
COMPUTER READABLE FORM: 

MEDIUM TYPE: DISKETTE, 3.5 INCH, 1.44 Mb STORAGE 
COMPUTER: IBM 4 86 


OPERATING SYSTEM: WINDOWS FOR WORKGROUPS 

SOFTWARE: MICROSOFT WORD 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/724, 984A 

FILING DATE: October 3, 1996 

CLASSIFICATION: 800 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 

FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: William T. Han 

REGISTRATION NUMBER: 34,344 

REFERENCE/DOCKET NUMBER: ATG50023 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610 270 5024 

TELEFAX: 610 270 5090 
; INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 354 
; TYPE: Amino Acid 

TOPOLOGY: Linear 
US-08-724-984A-2 


Query Match 30.8%; Score 272; DB 4; Length 354; 

Best Local Similarity 27.9%; Pred. No, le-22; 

Matches 80; Conservative 12; Mismatches 29; Indels 166; Gaps 8; 

QY 20 MDYQVSSP — IYDINYYTSEPCQKINVKQIAA ■ 49 

I I : I I I I I I I : I I I I I I I I I I I I I I 
Db 1 ^DFQGSVPTYIYDIDYGMSAPCQKINVKQIAAQLLPPLYSLVFIFGFAGNMMVFLILISC 60 

Q y 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG- 83 

II I I : I I I I I I : I : I I I 

Db 61 KKLKSWDIYLLNLAISDLLFLLTLPFWAHYAANEWIFGNIMC- -KVFTGVYHIGYFGGI 118 

QY 84 LCTRSQKEGLHY 95 

Db 119 FFI I LLT I DRYIA.I VHAVFALKVTTVN FGVI T S VVTWVVAVFAS L PE 1 1 FTRS QKEGFH Y 17 8 

Qy 96 TCSSHFPYSQYQFWKNFQTLK IHQRVHGGGGSYKC 130 

I I I I I I :: I I I I I : I I I I I : | - - | 

Db 179 TCSPHFPHTQYHFWKSFQTLKMVILSLILPLLVMIICYSGILHT LFRCRNEKK 231 

Q y 131 " GLCQEFFGLNNCSSSNRLD 149 

I I I I I I I I I I I I I I I I 

Db 232 RHRAVRLIFAIMIVYFLFWTPYNIVLLLTTFQEFFGLNNCSSSNRLD 278 


RESULT 12 
US-08-833-752-4 

; Sequence 4, Application US/08833752 

; Patent No. 6448375 

; GENERAL INFORMATION: 

; APPLICANT: SAMSON, MICHEL 

APPLICANT: PARMENTIER, MARC 
APPLICANT: VAS S ART , GILBERT 
APPLICANT: LIBERT, FREDERICK 


TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES ' RECEPTOR 
TITLE OF INVENTION: AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES : 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/0 8/833,752 
FILING DATE: 9-APR-1997 
CLASSIFICATION: 536 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/DOCKET NUMBER: 
INFORMATION FOR SEQ ID NO: 4: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 184 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-752-4 

Query Match 29.2%; Score 258; DB 4; Length 184; 

Best Local Similarity 34.9%; Pred. No. 1.8e-21; 

Matches 65; Conservative 4; Mismatches 13; Indels 104; Gaps 4; 

MDYQVS S P I YD I N Y YT SEPCQKINVKQIAA 49 

I I M I I I I I I I I I I I I I I I II II I I I I I I I 

MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 

20 

Db 

1 

Qy 

50 

Db 

61 

Qy 

84 

Db 

119 

Qy 

98 

Db 

179 


- YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG 8 3 

I I I I I II II I I II I I : I : : I 


-LCTRSQKEGLHYTC 97 
: I I I I I I I I I I I I 


I I I I 


RESULT 13 
US-09-087-232A-17 

; Sequence 17, Application US/09087232A 
; Patent No. 6153431 
; GENERAL INFORMATION: 


; APPLICANT: Quillent et al . 

TITLE OF INVENTION: HUMAN IMMUNODEFICIENCY VIRUS CO-RECEPTOR 
TITLE OF INVENTION: VARIANTS ASSOCIATED WITH RESISTANCE TO VIRUS 
INFECTION. 

NUMBER OF SEQUENCES: 23 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Baker & Botts, L.L.P. attn. Lisa Kole 
; STREET: 30 Rockefeller Plaza 

CITY: New York 
STATE: New York 
; COUNTRY: USA 

ZIP: 10112 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/09/ 087 , 232A 

FILING DATE: 28 MAY 1998 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/048,057 
FILING DATE: 30 MAY 1997 
ATTORNEY/ AGENT INFORMATION: 
NAME: KOLE, LISA B. 
REGISTRATION NUMBER: 35,225 
REFERENCE/DOCKET NUMBER: AP 31115 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (212) 408-2628 

TELEFAX: (212) 765-2519 
INFORMATION FOR SEQ ID NO: 17: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 215 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
; MOLECULE TYPE: protein 
US-09-087-232A-17 


Query Match 29.2%; Score 258; DB 3; Length 215; 

Best Local Similarity 34.9%; Pred. No. 2.2e-21; 

Matches 65; Conservative 4; Mismatches 13; Indels 104; Gaps 4; 
QY 2 0 MDYQVS S PI YDIN YYT SEPCQKINVKQIAA 49 

I I I M I I I I I I I II I M I I I I I I I I I I I I I 

Db 1 MDYQVS SPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVI LI LINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

II I I I II I I I I I I I I : I : : | 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGI FF 118 

Qy 84 LCTRSQKEGLHYTC 97 

' II I I I I I I I I II 

D]D 119 1ILLTIDRYLAVVHAVFALKARTVTFGVVTSVI TWWAVFASLPGIIFTRSQKEGLHYTC 17 8 


Qy 


98 SSHFPY 103 
I I I I I I 


Db 


179 SSHFPY 184 


RESULT 14 
US-08-833-752-6 

Sequence 6, Application US/08833752 
Patent No. 6448375 
GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 
APPLICANT: PARMENTIER, MARC 
APPLICANT: VASSART, GILBERT 
APPLICANT: LIBERT, FREDERICK 

TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 
TITLE OF INVENTION: AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/833 , 752 
FILING DATE: 9-APR-1997 
CLASSIFICATION: 536 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/ DOCKET NUMBER: 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 215 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-833-752-6 

Query Match 29.2%; Score 258; DB 4; Length 215; 

Best Local Similarity 34.9%; Pred. No. 2.2e-21; 

Matches 65; Conservative 4; Mismatches 13; Indels 104; Gaps 4; 

2 0 MD YQVS S P I YDI N YYT SEP CQKI NVKQI AA 49 

M I M I I I I I I I I I I | I I I I I I I I I I I I I I 
Db 1 MDYQVSSPI YDINYYTSEPCQKINVKQIA7VRLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

II I I M I I I I I I I I I : I :: I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 


Db 


119 IILLTIDRYLAVVHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 17 8 


QY 98 SSHFPY 103 

M I I I I 

Db 179 SSHFPY 184 


RESULT 15 
US-09-087-232A-15 

Sequence 15, Application US/09087232A 
Patent No. 6153431 
GENERAL INFORMATION: 

APPLICANT: Quillent et al. 

TITLE OF INVENTION: HUMAN IMMUNODEFICIENCY VIRUS CO-RECEPTOR 
INFECTION^ ° F INVENTI ° N: VAR ™ S ASSOCIATED WITH RESISTANCE TO VIRUS 
NUMBER OF SEQUENCES : 23 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Baker & Botts, L.L.P. attn. Lisa Kole 
STREET: 30 Rockefeller Plaza 
CITY: New York 
STATE: New York 
COUNTRY: USA 
ZIP: 10112 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 087 , 232A 
FILING DATE: 28 MAY 1998 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/048,057 
FILING DATE: 30 MAY 1997 
ATTORNEY/AGENT INFORMATION: 
NAME: KOLE, LISA B. 
REGISTRATION NUMBER: 35,225 
REFERENCE/DOCKET NUMBER: AP 31115 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 408-2628 
TELEFAX: (212) 765-2519 
INFORMATION FOR SEQ ID NO: 15: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 100 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-09-087-232A-15 

Be^ Y ro at f,- - n ■ 20 ' 6%; SC ° re 182 - 5; 3 ; Length 100; 

Best Local Similarity 43.0%; Pred. No 3e-13- 

Matches 43; Conservative 0; Mismatches ' 4; Indels 53; Gaps 2; 
Q y 20 MDYQVS S PI YDI N YYTS EPCQKINVKQI AA AQ 

UNI III INI I MMM MM | | || Ml 


Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 
2Y 50 YKCGLC AAAQWD FGNTM 66 

100 


Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTM 


Search completed: September 17, 2004, 16:49:04 
Job time : 25.4814 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 
Run on: 


Title: 

Perfect score: 
Sequence: 


September 17, 2004, 16:42:06 ; Search time 25.5458 Seconds 

(without alignments) 
591.177 Million cell updates/sec 

US-10-057-890A-31 
884 

1 MKVSVAALSCLMLVTALGSM GLNNCS S SNRLDGHQRVHAA 157 


Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


283366 


Database 


PIR 78:* 


1: 

pirl : * 

2: 

pir2 :* 

3: 

pir3 : * 

4: 

pir4 : * 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 


Description 


Result 
No. 

Score 

Query 
Match 

Length 

DB 

ID 

1 

379 

42.9 

352 

2 

A43113 

2 

154 

17.4 

201 

2 

157505 

3 

144 

16.3 

693 

2 

137570 

4 

143.5 

16.2 

319 

2 

T46469 

5 

141 

16. 0 

555 

2 

153869 

6 

138.5 

15.7 

223 

2 

F31201 

7 

136.5 

15.4 

348 

2 

138599 

8 

136.5 

15.4 

386 

2 

T12527 

9 

135.5 

15.3 

572 

2 

139311 

10 

134 

15.2 

710 

2 

148668 

11 

133.5 

15.1 

475 

2 

S03679 

12 

133.5 

15.1 

636 

2 

148689 

13 

133.5 

15.1 

701 

2 

T14757 


chemokine (C-C) re 
zinc finger protei 
zinc finger protei 
hypothetical prote 
zinc finger protei 
GLI-related finger 
zinc finger protei 
hypothetical prote 
Kruppel-type zinc 
zinc finger protei 
finger protein (cl 
gene NK10 protein 
hypothetical prote 


14 

-LOl , J 

X 4 

. y 

686 

2 

A34612 

15 

lOl 

X4 

0 
. 0 

261 

2 

S70006 

16 

-L z? . O 

1 A 

X4 

c 
. D 

614 

2 

JH0500 

17 

x ^ o , o 

1 A 

X4 

n 
. 3 

580 

2 

A37107 

18 


1 A 

X4 

A 

469 

2 

138600 

19 

1 97 ^ 

1 A 

14 

. 4 

1173 

2 

150620 

20 

19 7 
xz / 

1 A 

14 

. 4 

728 

2 

A48830 

21 

1 9 6 R 

1 A 

X4 

0 
. 0 

732 

2 

S47073 

22 

1 94 

XZ fi 

1 A 

X4 

. U 

367 

2 

S06582 

23 

1 94 
XZ fi 

1 A 

X4 

. U 

589 

2 

138598 

24 

1 OO c 
j , j 

1 A 

X4 

. U 

209 

2 

S47068 

25 

1 9^ S 
XZ O . 3 

1 A 

X4 

. U 

347 

2 

S00549 

2 6 

1 9^ ^ 
x z 0 . 0 

1 A 

X 4 

. U 

378 

2 

S33994 

27 

1 99 R 
xz z . 0 


r\ 

. y 

488 

2 

S47072 

^. o 

1 99 ^ 
xz z . 0 

X 6 , 


594 

2 

T12488 


1 99 ^ 
XZ Z . 3 

T O 

lo . 

, y 

803 

2 

S26823 

-J u 

xzz 

13 , 

. 8 

1350 

2 

S00647 

O J. 

191 ^ 
X Z X . 0 

lo . 

1 

337 

2 

S60520 


1 9 n ^ 
xz u . o 

1 Q 

lo . 

c 
O 

169 

2 

A39240 


1 9 n ^ 
xz u , 0 

lo . 

c 
D 

229 

2 

A48927 

^4 

ion c 

XZ u . o 

lo . 

6 

292 

2 

S43826 

o o 

ion r 

XZ U . D 

13 . 

6 

399 

2 

S47071 


19 0 R 

lo . 

r- 
D 

546 

2 

149636 

37 

19 0 R 
XZ u . 3 

lo . 

r 
D 

1191 

2 

S35305 

o o 

100 

xz u 

lo . 

6 

435 

2 

S00833 


ion 
Xz U 

lo . 

6 

654 

2 

A57785 

40 


1 ^ 
xo . 

O 

1 C\ A 

xy 4 

2 

153859 

41 

119.5 

13. 

5 

393 

2 

JN0533 

42 

119.5 

13. 

5 

576 

2 

A48157 

43 

119.5 

13. 

5 

671 

2 

JE0288 

44 

118.5 

13. 

4 

107 

2 

139315 

45 

118.5 

13. 

4 

120 

2 

G02493 


zinc finger protei 
finger protein zfo 
zinc finger protei 
spermatogenesis pr 
zinc finger protei 
procKr2 - chicken 
probable trans crip 
finger protein HZF 
finger protein (cl 
zinc finger protei 
finger protein HZF 
developmental cont 
finger protein ZNF 
finger protein HZF 
hypothetical prote 
zinc finger protei 
finger protein - A 
finger protein ZNF 
finger protein mfg 
Kruppel-like zinc 
finger protein OZF 
finger protein HZF 
DNA-binding protei 
zinc finger protei 
finger protein (cl 
finger protein ZNF 
zinc finger protei 
finger protein pML 
renal transcriptio 
krueppel-type zinc 
zinc-finger protei 
KR-ZNF1 - human (f 


ALIGNMENTS 


RESULT 1 
A43113 

chemokine (C-C) receptor 5 - human 
N; Alternate names: C-C CKR-5; CCR5 
C; Species: Homo sapiens (man) 

C;Date: 12-Jul-1996 #sequence_revision 12-Jul-1996 #text change 20-Jun-2000 
C;Accession: A43113; S71808; A58834; A58832; G02653; A58833 
R;Samson, M. ; Labbe, O. ; Mollereau, C; Vassart, G. ; Parmentier, M. 
Biochemxstry 35, 3362-3367, 1996 

Trtllll M ° leCUlar Cl ° nin S * nd Actional expression of a new human CC-chemokine 
xecepuor gene. 

A; Reference number: A43113; MUID: 96241590; PMID: 8639485 
A; Accession: A43113 
A;Molecule type: mRNA 
A; Residues: 1-352 <SAM1> 

A;Cross-references: GB:X91492; NID: gl262810; PIDN: CAA62796 . 1 ; PID:ql262811 
Saraaos^ H - f . L ^ Xt ' ^' ' ^ oranz ' B.J., Rucker, J.; Liesnard, C; Farber, CM.; 
t't^ll ^ S ^ La P° ume ^l^, C; Cognaux, J.; Forceille, C; Muyldermans, G. ; 
R J 1 ; Burtonboy G.; Georges, M. ; Imai, T. ; Rana, S.;Yi, Y. ; Smyth, 

R.J., Collman, R.G.; Doms, R.W.; Vassart, G. ; Parmentier, M. 


Nature 382, 722-725, 1996 

A; Title: Resistance to HIV-1 infection in Caucasian individuals bearing mutant 

alleles of the CCR-5 chemokine receptor gene. 

A; Reference number: S71808; MUID: 96345670 ; PMID: 8751444 

A; Access ion: S718 08 

A; Status: nucleic acid sequence not shown; not compared with conceptual 

translation 

A;Molecule type: DNA 

A; Residues: 182-2 06 ; 207-230 <SAM2> 

A; Accession: A58 834 

A; Status: nucleic acid sequence not shown; not compared with conceptual 

translation 

A;Molecule type: DNA 

A;Residues: 1-184 , ' IKDSHLGAGPAAACHGHLLLGNPKNSASVSK ' <SAM3> 

A;Cross-references: GB:X99393; NID : gl524062 ; PIDN : CAA677 67 . 1 ; PID:gl524063 
A;Note: this frameshift mutation results in a non-functional receptor but 
confers a degree of resistance to HIV-1 infection; it has an allele frequency of 
0.09 or more in some Caucasian populations and may have had a selective 
advantage by conferring resistance to Yersinia plague infections 
R;Combadiere, C. ; Ahuja, S.K.; Tiffany, H.L.; Murphy, P.M. 
J. Leukoc. Biol. 60, 147-152, 1996 

A;Title: Cloning and functional expression of CC CKR5, a human monocyte CC 

chemokine receptor selective for MlP-lalpha, MlP-lbeta, and RANTES . 

A; Reference number: A58832; MUID : 96295970; PMID: 8699119 

A;Accession: A58832 

A; Molecule type: mRNA 

A; Residues: 1-352 <C0M1> 

A;Cross-references: GB:U57840; NID : gl502408 ; PIDN: AAB17071 . 1; PID:gl502409 

A; Experimental source: clone 8, endotoxin-stimulated peripheral blood monocytes 

R;Combadiere, C. 

submitted to the EMBL Data Library, May 1996 
A;Reference number: H01541 
A; Access ion: G02 653 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-89, ' L ■ , 91-352 <COM2> 
A; Cross-references : EMBL:U57840 

R;Raport, C.J.; Gosling, J.; Schweickart, V.L.; Gray, P.W.; Charo, I F 
J. Biol. Chem. 271, 17161-17166, 1996 

A; Title: Molecular cloning and functional characterization of a novel human CC 

chemokine receptor (CCR5) for RANTES, MlP-lbeta, and MlP-lalpha. 

A; Reference number: A58833; MUID: 96291862 ; PMID: 8663314 

A; Accession : A58 833 

A; Molecule type: mRNA 

A; Residues: 1-352 <RAP> 

A;Cross-references: GB:U54994; NID : gl457945 ; PIDN : AAC50598 . 1 ; PID:gl457946 
C;Comment: This is a receptor for chemokines MlP-lalpha (see PIR:A30574), MIP- 
lbeta (see PIR:A31767), and RANTES (see PIR:A28815). 

C; Comment: Macrophage- and dual-tropic strains of HIV-1 bind to a complex of 
chemokine (C-C) receptor 5 and T-cell surface glycoprotein CD4 (see PIR:RWHUT4) 
C; Genetics: 

A;Gene: GDB : CMKBR5 ; CCR5; CKR-5; CC-CKR-5; CKR5; ChemR13 
A; Cross-references : GDB: 1230510; OMIM: 601373 
A; Map position: 3p21-3p21 
C; Function: 

A;Description: G protein-coupled receptor for chemokines MlP-lalpha, MlP-lbeta 
and RANTES 


A;Note: probably acts to control granulocyte proliferation and differentiation 
C; Superf amily: vertebrate rhodopsin 

C;Keywords: AIDS; G protein-coupled receptor; glycoprotein; phosphoprotein; 
transmembrane protein 

F;32-56/Domain: transmembrane #status predicted <TM1> 
F; 67-87/Domain: transmembrane #status predicted <TM2> 
F; 103-124/Domain: transmembrane #status predicted <TM3> 
F; 142-166/Domain: transmembrane #status predicted <TM4> 
F; 193-2 18 /Domain : transmembrane #status predicted <TM5> 
F;236-257/Domain: transmembrane #status predicted <TM6> 
F;285-300/Domain: transmembrane #status predicted <TM7> 
F;20-269, 101-178/Disulfide bonds: fstatus predicted 

F;268/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F;336,337 342/Binding site: phosphate (Ser) (covalent) #status predicted 
F;340, 343/Binding site: phosphate (Thr) (covalent) #status predicted 

Query Match 42.9%; Score 379; DB 2; Length 352; 

Best Local Similarity 35.0%; Pred. No. 8e-29; 

Matches 99; Conservative 7; Mismatches ^27; Indels 150; Gaps 7; 


Qy 

20 

Db 

1 

Qy 

50 

Db 

61 

Qy 

84 

Db 

119 

Qy 

98 

Db 

179 

Qy 

125 

Db 

239 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


- YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG 8 3 

I I I I I I I I I I I I I I I : | : : | 


LCTRSQKEGLHYTC 97 
E I LLTI DRYIAVVHAVFALKART VT FGWT S VI TWWAVFAS LPGI I FTRS QKEGLH YTC 178 


fi^x^nAiHryiuM HQRVHGG 124 

I I I I I I If I I I I I I .. | 

SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


-GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 


RESULT 2 
157505 

zinc finger protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Aug-1996 #sequence_revision 02-Aug-1996 #text change 05-Nov-1999 
C;Accession: 157505 ~ 
R;Ernoult-Lange, M. ; Kress, M. ; Hamer, D. 
Mol. Cell. Biol. 10, 418-421, 1990 

A;Title: A gene that encodes a protein consisting solely of zinc finger domains 
is preferentially expressed in transformed mouse cells. 
A;Reference number: 157505; MUID: 90097859; PMID:2104662 
A; Accession: 157505 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-201 <RES> 

A/Cross-references: GB:M32057; NID:gl99766; PIDN : AAA39729 . 1; PID:gl99767 


C; Genetics : 
A; Gene: M0K2 


Query Match 17.4%; Score 154; DB 2; Length 201; 

Best Local Similarity 37.0%; Pred. No. 1.6e-07; 

Matches 44; Conservative 13; Mismatches 44; Indels 18; Gaps 6 

Q y 45 KQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTR--SQKEGLHYTCSSHF- 101 

' 1 : 1 1 2 s : : : I I I I I lllll:|| II : | 

Db 60 KPYACEECGMSFSQR SNLHIHQRVH-TGERPYKCGECGKGFSQSSNLHIHRCTHTG 114 

Q y 102 PYSQYQFWKNF QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 155 

N I • I I I : I I I I I I llll:| | | : : | (Mil 

Db 115 EKPYQCYECGKGFSQSSDLRIHLRVHTGEKPYHCGKCGQGF SQS SKLLIHQRVH 168 


RESULT 3 
137570 

zinc finger protein - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text change 01-Dec-2000 
C;Accession: 137570 

R;Abrink, M. ; Aveskogh, M. ; Hellman, L. 
DNA Cell Biol. 14, 125-136, 1995 

A;Title: Isolation of cDNA clones for 42 different Kruppel-related zinc finger 
proteins expressed in the human monoblast cell line U-937. 
A; Reference number: 137566; MUID : 95169271 ; PMID:7865130 
A; Accession: 13757 0 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-693 <RES> 

A;Cross-references: EMBL:X78927; NID:g498726; PIDN : CAA55527 . 1 ; PID:g498727 
C; Genetics: 
A; Gene: HZF4 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 16.3%; Score 144; DB 2; Length 693; 

Best Local Similarity 33.3%; Pred. No. 5.1e-06; 

Matches 39; Conservative 15; Mismatches 33; Indels 30; Gaps 7; 

QY 4 0 QKINVKQIAAYKCGLCAAA-QWDFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCS 98 

II*:: I I I I I = I I I I I I I INI If::: 
Db 554 QKVHTGE-KPYKCGECGKGFKWSLNLDM — HQRVH-TGEKPYTCGAC GKHFSQA 603 

99 SHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 155 

1 : I : : : I : I I | : : | | : | | | 

m 604 s SLQLHQSVHTGEKPYKCDVCGKVF S RS SQLQ YHRRVH 641 

RESULT 4 
T46469 

hypothetical protein DKFZp434G1930 . 1 - human (fragment) 
C; Species: Homo sapiens (man) 

C;Date: 04-Feb-2000 #sequence_revision 04-Feb-2000 #text_change 04-Feb-2000 
C;Accession: T46469 

R;Blum, H.; Bauersachs, S.; Mewes, H.W.; Gassenhuber, J.; Wiemann, S. 
submitted to the Protein Sequence Database, January 2000 


A;Reference number: Z23034 
A; Accession: T4 64 69 
A; Status : preliminary 
A;Molecule type: mRNA 
A; Residues: 1-319 <AAA> 
A;Cross-references : EMBL: AL137483 

A; Experimental source: adult testis; clone DKFZp434G1930 
C; Gene tics: 

A; Note : DKFZp434G1930 . 1 

Query Match 16.2%; Score 143.5; DB 2; Length 319; 

Best Local Similarity 32.8%; Pred. No. 2.6e-06; 

Matches 38; Conservative 15; Mismatches 44; Indels 19; Gaps 7; 

Q y 50 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCSSHF P 102 

Mil : : : : | | | : | I I I | : : | | | 

Db 9 YKCNECEKT-FSHRSSLLSHQRIH-TGEKPYKCNECEKAFSNS — STLIKHLRVHTGEKP 64 

Qy 103 YSQYQFWKNF QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 155 

1 : 'I 11:111:11 I I I I I :: | | | : : | | | | s | 
Dto 65 YRCRECGKAFSQCSTLTVHQRIHTGEKLYKCGECEKAF NCRA — KLHRHQRIH 115 


RESULT 5 
153869 

zinc finger protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 29-May-1998 #sequence_revision 29-May-1998 #text change 01-Dec-2000 
C;Accession: 153869 ~ 
R; Brady, J. P.; Piatigorsky, J. 
Gene 149, 299-304, 1994 

A; Title: A mouse cDNA encoding a protein with zinc- fingers and a KRAB domain 
shows similarity to human prof ilaggrin . 

A;Reference number: 153869; MUID : 950474 92 ; PMID:7959006 
A;Accession: 153869 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A;Molecule type: mRNA 
A; Residues: 1-555 <RES> 

A; Cross-references: GB:L28167; NID:g758660; PIDN : AAA6754 5 . 1 ; PID:g758661 
C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 16.0%; Score 141; DB 2; Length 555; 

Best Local Similarity 23.8%; Pred. No. 7.9e-06; 

Matches 50; Conservative 25; Mismatches 47; Indels 88; Gaps 11; 

Qy 23 QVSSPIYDINYYTSEPCQ KINVKQIAAYKCGLCAAAQWDFGNTMCQH 69 

:| :| : II I I: | : : : : | | | : | : : : | 

Db 303 RVCTP- — VKPYTCEQCEKSLLVDQHLMSHVKVHTRE-RPYNCETCGSA-FSQASHLQDH 357 


Qy 70 QRVH GHHHHS- 


-YKCGLCTRSQKEGLHYTCSSHF- 101 


; 11 Mil!:! : | | | : 

FICSSNLY 411 


Db 358 QRLHTGEKPFKCDACGKSFSRSSHLRSHQRVHTGEKPYKCGECGKS 


Q ^ 102 PYSQYQFWKNF QTLKIHQRVHGGGGSYKCGLCQEFFGLNN 141 

II M :h II :| I M I :| : : ||: 

Db 412 IHQRVHTGEKPYKCVDCGKEFSRPSSLQAHQGIHTGEKSYVCTMCGKGYTLNSNLQVHLR 471 


142 C S S SNRLDGHQRVH 155 

I I I : : I I I I I I 

Db 472 VHTGEKPYSCDVCGKGFSRSSQLQSHQRVH 501 


RESULT 6 
F31201 

GLI-related finger protein HKR4 - human (fragments) 
C; Species: Homo sapiens (man) 

C;Date: 31-Mar-1990 #sequence_revision 31-Mar~1990 #text change 17-Mar-1994 
C;Accession: F31201 ~~ 

R;Ruppert, J.M.; Kinzler, K.W. ; Wong, A. J.; Bigner, S.H.; Kao, F.T.; Law, M.L.; 

Seuanez, H.N. ; O'Brien, S.J.; Vogelstein, B. 

Mol. Cell. Biol. 8, 3104-3113, 1988 

A; Title: The GLI-Kruppel family of human genes. 

A; Reference number: A93103; MUID : 89096896; PMID:2850480 

A; Accession: F31201 

A;Molecule type: DNA 

A; Residues: 1-223 <RUP> 

C; Keywords: DNA binding; zinc finger 

Query Match 15.7%; Score 138.5; DB 2; Length 223; 

Best Local Similarity 30.7%; Pred. No. 5.6e-06; 

Matches 42; Conservative 13; Mismatches 39; Indels 43; Gaps 7; 

Q y 50 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCSSHFPYSQYQFW 109 

' : ' ' ' ' : : M 1:1 : MM I I I: || : ||: | 
Db 96 YECGQCGRA-FSHSSHFTQHLRIH-NGEKPYKCGECVRHQR — LH TGEKPYACSQCG 14 8 

Qy 110 KNF QTLKIHQRVHGGGGSYK CGLCQEFFG 138 

I Ml I : 111:11 

Db 149 KAFIWSSVLIEHQRIHTGEKPYECSDCGKAFRGRSHFFRHLRTHTGEKPFACGACGKAFG 208 

QY 139 LNNCSS SNRLDGHQRVH 155 

I'M Mill 
Db 209 QSSQLIQHQRVH 220 


RESULT 7 
138599 

zinc finger protein ZNF134 - human 
C; Species: Homo sapiens (man) 

C;Date: 23-Feb-1996 #sequence_revision 23-Feb-1996 #text change 05-Nov-1999 
C;Accession: 138599 ~ 
R;Tommerup, N.; Vissing, H. 
Genomics 27, 259-264, 1995 

A; Title: Isolation and fine mapping of 16 novel human zinc finger-encoding cDNAs 

identify putative candidate genes for developmental and malignant disorders. 

A; Reference number: A57785; MUID: 96044430; PMID: 7557990 

A; Accession: 138 599 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-348 <RES> 

A;Cross-references: EMBL:U09412; NID:g488552; PIDN : AAC50253 . 1 ; PID:g488553 

C; Genetics: 

A; Gene: GDB : ZNF134 

A;Cross-references : GDB: 137033 


A; Map position: 19ql3 . 4-19ql3 . 4 


Qy 

Db 


Query Match 15.4%; Score 136.5; DB 2; Length 348; 

Best Local Similarity 27.3%; Pred. No. 1.3e-05; 

Matches 45; Conservative 13; Mismatches 46; Indels 61; Gaps 9; 

50 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLC TRSQKEGLH YT 96 

III I I : :|: IIICI |:| | | | : :| | 

97 YKCSECGKA-FSRKDTLVQHQRIHS-GEKPYECSECGKAFSRKATLVQHQRIHTGERPYE 154 

Qy 97 CS SHFPYSQYQFWKNF QTLKIHQRVHGGGGSYKCGLC 133 

Db 155 CSECGKTFSRKDNLTQHKRIHTGF^PYKCNECGKYFSHHSNLI^QR^NGARPYKCSDC 214 

QY 134 QEFF GLN NCSSSNRLDG HQRVH 155 

! 1 I I : I I : I I I I : I 

Db 215 GKVFRHKSTLVQHESIHTGENPYDCSDCGKSFGHKYTLIKHQRIH 259 


RESULT 8 
T12527 

hypothetical protein DKFZp434N043 . 1 - human (fragment) 
C; Species: Homo sapiens (man) 

C^ssS^^ ^^-^sion 23- Ju l-1999 #text_change 01-Dec-2000 

R;Wambutt, R. ; Heubner, D.; Mewes, H.W.; Gassenhuber, J.; Wiemann, S. 

submitted to the Protein Sequence Database, June 1999 

A;Reference number: Z17524 

A;Accession: T12527 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 1-38 6 <WAM> 

A;Cross-references : EMBL : AL080143 

A; Experimental source: adult testis; clone DKFZp434N043 
C; Genetics: 

A;Note: DKFZp434N043 . 1 

C; Super family: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 15.4%; Score 136.5; DB 2; Length 386; 

Best Local Similarity 20.2%; Pred. No. 1.5e-05- 

Matches 47; Conservative 22; Mismatches 73; Indels 91; Gaps 7; 

Q y 6 AAL S C LMLVT AL G SMD YQ VS SPIYDINYYTSEPC QKINVKO 4 6 

=11 II = II I I I || . 

43 STLKCHESVHAREKQGFFVSGKILDQNPEQKEKCFKCNKCEKTFSCSKYLTQHERIHTRG 102 

Qy 47 IAAYKCGLCAAAQWDFGNT— MCQHQRVHGH HHH 78 

: : : I I I I I = = I I I : I , 
103 VKP FECDQCGKA FGQSTRLI HHQRI HS RVRLYKWGEQGKAI S SAS LI KLQS FHTKEH 159 

Q y 79 SYKCGLCTRSQKEGLHYT — CSSHFPYSQY 106 

: I I I : : I : | | | 

Db 160 PFKCNECGKTFSHSAHLSKHQLIHAGENPFKCSKCDRVFTQRNYLVQHERTHARKKPLVC 219 

Qy 107 QFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 155 

: : : I I I I : I I II I : M I I I I I I • I 

Db 220 NECGKTFRQSSCLSKHQRIHSGEKPYVCDYCGKAFGL SAELVRHQRIH 267 


Db 


Db 


RESULT 9 
139311 

Kruppel-type zinc finger protein ZNF74 - human 
C; Species: Homo sapiens (man) 

C;Date: 06-Sep-1996 #sequence_revision 06-Sep-1996 #text change Ol-Dec-2000 
C;Accession: 139311; F42825; S37482 

R;Aubry, M. ; Demczuk, S.; Desmaze, C; Aikem, M. ; Aurias, A.; Julien, J P • 
Rouleau, G.A. * 
Hum. Mol. Genet. 2, 1583-1587, 1993 

A;Title: Isolation of a zinc finger gene consistently deleted in DiGeorge 
syndrome. 

A; Reference number: 139311; MUID : 94 093543 ; PMID: 8268910 
A;Accession: 139311 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-572 <RES> 

A; Cross-references: EMBL:X71623; NID : gl050529 ; PIDN : CAA50632 . 1 ; PID:gl050530 
A; Note: submitted to the EMBL Data Library, April 1993, revised 01-NOV-1995 
R;Aubry, M. ; Marineau, C. ; Zhang, F.R.; Zahed, L. ; Figlewicz, D. ; Delattre, O.; 
Thomas, G . ; de Jong, P.J.; Julien, J. P.; Rouleau, G.A. 
Genomics 13, 641-648, 1992 

A; Title: Cloning of six new genes with zinc finger motifs mapping to short and 
long arms of human acrocentric chromosome 22 (p and qll.2). 
A; Reference number: A42825; MUID : 92347859 ; PMID: 1639391 
A; Accession: F42 82 5 

A; Status: preliminary; not compared with conceptual translation 

A; Molecule type: nucleic acid 

A; Residues: 427-444 , 'V , 4 46-511 <AUB> 

A; Note: sequence extracted from NCBI backbone (NCBIP : 109775 ) 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 15.3%; Score 135.5; DB 2; Length 572; 

Best Local Similarity 29.2%; Pred. No. 2.7e-05; 

Matches 33; Conservative 14; Mismatches 25; Indels 41; Gaps 5; 

QY 50 YKCGLCAAAQWDFGNTMCQHQRVHGHHHH SYKCGLCTRSQKEGLHYTCSSHFP 102 

1 : I I I I : I I I || |:||| | : | | | 
Db 288 YRCGECGKA FNQRTHLTRHHRIHTGEKPYQCGSC GKAFTCHS 32 9 

®y 103 YSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 155 

:| :|:::| I : I I | : : | :| :|| III | 

Db 330 SLTVHEKIHSGDKPFKCSDCEKAF NSRSRLTLHQRTH 366 

RESULT 10 
148668 

zinc finger protein 51 - mouse 
N;Alternate names: finger protein zfecl2 
C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text change 08-Dec-2000 
C;Accession: 148668; A40984 ~ 
R;Burke, P.S.; Don, J.; Wolgemuth, D.J. 
Mamm. Genome 5, 387-389, 1994 


A; Title: Zfp-51, a murine zinc finger encoding gene mapping to the t-complex 
region of chromosome 17, encodes 19 contiguous zinc fingers and is ubiquitously 
expressed. ^ J 

A; Reference number: 148668; MUID: 94319090; PMID: 8043957 
A; Accession: 14 8 668 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-710 <RES> 

A;Cross-references: EMBL:X74855; NID:g488832; PIDN: CAA52847 . 1; PID:g488833 
R;Crossley, P.H.; Little, P.F.R. 

Proc. Natl. Acad. Sci. U.S.A. 88, 7923-7927, 1991 

A;Title: A cluster of related zinc finger protein genes is deleted in the mouse 
embryonic lethal mutation t(wl8). 

A; Reference number: A40984; MUID : 91376058 ; PMID: 1680234 

A; Accession: A4 0984 

A; Status: preliminary 

A;Molecule type: mRNA 

A; Residues: 672-710 <CRO> 

A;Cross-references : GB:M74235 

C; Genetics : 

A; Gene: Zfp-51 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 
C; Keywords: DNA binding; zinc finger 

Query Match 15.2%; Score 134; DB 2; Length 710; 

Best Local Similarity 28.6%; Pred. No. 4.7e-05; 

Matches 38; Conservative 20; Mismatches 39; Indels 36; Gaps 7; 

Q y 28 IYDINYYTS EPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKC 82 

: i I : I : : I : I | | | | | | s : s : : I |:| | | | 

Db 120 VYD-DYFSSAYSLMQPTAYIKEKPHQCGKCGKC FNTSSSLTVHHRIH-TKEKPYKC 173 

Q y 83 GLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGLNNC 142 

,_ A : ' : ' : I : I I I I I I I I I I I | | :: | 
Db 174 SVCDKS FTQCTH LKIHQRRHTGEKPYKCKECEKSF 208 

Qy 143 SSSNRLDGHQRVH 155 

: I I I : : I 
Db 209 VQLSALKSHQKLH 221 


RESULT 11 
S03679 

finger protein (clone mkr5) - mouse (fragment) 
C; Species: Mus musculus (house mouse) 

C;Date: 04-Dec-1992 #sequence_revision 04-Dec-1992 #text change Ol-Dec-2000 
C;Accession: S03679 ™ 
R;Chowdhury, K. ; Rohdewohld, H. ; Gruss, P. 
Nucleic Acids Res. 16, 9995-10011, 1988 

A; Title: Specific and ubiquitous expression of different Zn finger protein genes 
in the mouse. 

A;Reference number: S03677; MUID: 89057528 ; PMID:3143103 
A; Access ion: S 03 679 

A; Status: not compared with conceptual translation 
A;Molecule type: mRNA 
A; Residues: 1-475 <CHO> 


A;Note: the sequence in fig. 2 is inconsistent with that" shown in fig.l in having 
64-Asn, 159-Trp, 187-trp, 188-trp, and in lacking a Met after 64-Gly and an Asp 
after 78-Ser 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 15.1%; Score 133.5; DB 2; Length 475; 

Best Local Similarity 27.8%; Pred. No. 3.5e-05; 

Matches 40; Conservative 24; Mismatches 49; Indels 31; Gaps 8; 

Q y 34 YTSEPCQK INVKQ IAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYK 81 

II I M : I I | : | : | | : : : | | | | | | - | 

Db 223 YTCEVCHKSFRYGSSLTVHQRIHTGEKPYECEICRKA-FSHHASLTQHQRVHS-GEKPFK 280 

Qy 82 CGLCTRSQKEGLHYTCSSHF PYSQYQFWKNF QTLKIHQRVHGGGGSYKCG 131 

I I : : : : : I : | | : | : s | : | | | | | : | | : - | 

Db 281 GKECGKAFRQNIH LASHWRIHTGEKPFECGECGKSFSISSQLATHQRIHTGEKPFECK 338 

QY 132 LCQEFFGLNNCSSSNRLDGHQRVH 155 
D *> 339 VCRKAFRQN IHLASHWRIH 357 


RESULT 12 
148689 

gene NK10 protein - mouse 

C; Species: Mus musculus (house mouse) 

C;Date: 02-Jul-1996 #sequence_revision 02-Jul-1996 #text change Ol-Dec-2000 
C;Accession: 148689; S49078 ~ 

R;Lange, R. ; Christoph, A. ; Thiesen, H.J.; Vopper, G. ; Johnson, K.R.; Lemaire, 
L.; Plomann, M. ; Cremer, H. ; Barthels, D. ; Heinleim, U.A.O. 
DNA Cell Biol. 14, 971-981, 1995 

A;Title: Developmentally regulated mouse gene NK10 eucodes a xZInc Finger 
Repressor Protein with differential NDA-Binding Domains. 
A; Reference number: 148689; MUID: 96069544 ; PMID: 7576184 
A; Accession: 148689 

A; Status: preliminary; translated from GB/EMBL/DDB J 
A; Molecule type: mRNA 
A; Residues: 1-636 <RES> 

A;Cross-references: EMBL:X79828; NID:g506501; PIDN : CAA56225 . 1 ; PID:g506502 
C; Super family: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 15.1%; Score 133.5; DB 2; Length 636; 

Best Local Similarity 31.6%; Pred. No. 4.7e-05; 

Matches 37; Conservative 15; Mismatches 44; Indels 21; Gaps 6; 

Q y 50 YKCGLCAAAQWDFGNTMCQHQRVHGHHHHSYKCGLCTRS QKEGLHYTCSSHF 101 

Mill: : : : I I I I : I hi II II III | 
Db 306 YQCSLCGKA- FQRSSSLVQHQRIH-TGEKPYRCNLCGRSFRHSTSLTQHEVTH SGEK 360 

QY 102 PYSQYQFWKNF— QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 155 

Is : I I. : I I : I I I : : I : | | | I I I I : I 

Db 361 PFQCKECGKAFSRCSSLVQHERTHTGEKPFECSICGRAFG QSPSLYKHMRIH 412 


RESULT 13 
T14757 

hypothetical protein DKFZp572C163 . 1 - human (fragment) 


C; Species: Homo sapiens (man) 

C;Date: 20-Sep-1999 #sequence_revision 20~Sep-1999 #text change Ol-Dec-2000 

C;Accession: T14757 ~~ 

R;Wambutt, R. ; Heubner, D. ; Mewes, H.W.; Gassenhuber, J.; Wiemann, S. 

submitted to the Protein Sequence Database, August 1999 

A;Reference number: Z18181 

A; Accession: T14 757 

A; Status : preliminary 

A;Molecule type: mRNA 

A; Residues: 1-701 <WAM> 

A;Cross-references : EMBL: AL110217 

A; Experimental source: adult subthalamic nucleus; clone DKFZp572C163 
C; Genetics: 

A;Note: DKFZp572C163 . 1 

C;Superfamily: zinc finger protein ZFP-36; LIM metal-binding repeat homology 

Query Match 15.1%; Score 133.5; DB 2; Length 701; 

Best Local Similarity 23.4%; Pred. No. 5.2e-05; 

Matches 41; Conservative 16; Mismatches 37; Indels 81; Gaps 8; 

^ 50 YKCGLCAAAQWDFGNT MCQHQRVH GHHH-HS- 7 9 

I'll M : I I I I : I Ml: 

Db 468 YECNQC GKTFSQRTHLCAHQRIHTGEKPYECNECGKTFADNSALRAHHRIHTG 520 

2^ 80 YKCGLCTRSQKEGLHY TCSSHFPYSQYQFWKNF 112 

I : I I : : : I | | | | : | | 

521 EKPYECNDCGKTFSKTSHLRAHLRTRSGEKPYECSECGKTFSEKSYVSAHQRVHTGEKPY 580 

Q y 113 QTLKIHQRVHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVH 155 

M : : I I I : I I I I : I I : I | : | | | | : | 

Db 581 ECNVCGKPFAHNSTLRVHQRIHTGEKSYECNDCGKTF SQKSHLSAHQRIH 630 


Db 


RESULT 14 
A34612 

zinc finger protein ZNF7 - human 
N;Alternate names: zinc finger protein kox4 
C; Species: Homo sapiens (man) 

C;Date: 22-Jun-1990 #sequence_revision 22-Jun-1990 #text change Ol-Dec-2000 
C;Accession: A34612; A56409; S10421; 137972 

R;Lania, L.; Donti, E. ; Pannuti, A.; Pascucci, A.; Pengue, G. ; Feliciello, I.; 
La Mantia, G. ; Lanfrancone, L.; Pelicci, P.G. 
Genomics 6, 333-340, 1990 

A;Title: cDNA isolation, expression analysis, and chromosomal localization of 
two human zinc finger genes. 

A;Reference number: A34612; MUID: 90169993 ; PMID:2106481 
A;Accession: A34612 
A; Molecule type: mRNA 
A; Residues: 1-686 <LAN> 

A;Cross-references: GB:M29580; NID:g340445; PIDN : AAA61313 . 1 ; PID-a340446- 
GB:J04751 

R;Bray, P.; Lichter, P.; Thiesen, H.J.; Ward, D.C.; Dawid, I.B. 
Proc. Natl. Acad. Sci. U.S.A. 88, 9563-9567, 1991 

A;Title: Characterization and mapping of human genes encoding zinc finger 
proteins. 

A; Reference number: A56409; MUID: 92052132 ; PMID: 1946370 
A; Accession: A564 09 


A;Molecule type: DNA 

A; Residues: 425-589 <BRA> 

A; Cross-references : GB:M7717 0 

R;Thiesen, H.J. 

submitted to the EMBL Data Library, March 1990 

A;Reference number: S10397 

A; Accession: S10421 

A; Molecule type: mRNA 

A; Residues: 413-468 <THI> 

A; Cross-references: EMBL:X52335; NID:g34165; PIDN : CAA36561 . 1 ; PID:a930095 

R;Thiesen, H.J. 

New Biol. 2, 363-374, 1990 

A; Title: Multiple genes encoding zinc finger domains are expressed in human T 
cej_ _i_ s • 

A;Reference number: 137949; MUID: 91145339; PMID:2288909 
A; Accession: 137 972 

A; Status: translated from GB/EMBL/DDB J 

A; Molecule type: mRNA 

A; Residues: 413-468 <RES> 

A;Cross-references: EMBL:X52335; NID: g 34165; PIDN : CAA3 6561 . 1 ; PID:a930095 
C; Genetics: ' 
A; Gene : GDB : ZNF7 

A;Cross-references : GDB: 120509; OMIM: 194531 
A; Map position: 8q24 . 3-8q24 . 3 

C; Super family: zinc finger protein ZFP-36; LIM metal-binding repeat homology 
C;Keywords: DNA binding; zinc finger 

Query Match 14.9%; Score 131.5; DB 2; Length 686; 

Best Local Similarity 26.9%; Pred. No. 7.9e-05; 

Matches 45; Conservative 11; Mismatches 46; Indels 65; Gaps 8; 

Q y 50 YKCGLCAAAQWDFG NTMCQHQRVHGHHHHSYKCGLCTRSQKEGLH 94 

Ml I I II : : :||| | ^ 


Db 


441 YKCNKCTKA— FGCSSRLIRHQRTH-TGEKPFKCDECGKGFVQGSHLIQHQRIHTGEKP 496 


Qy 95 YTCS- 


Db 


SHFPYSQYQFWKNFQ TLKIHQRVHGGGGSYKCG 131 

11 1 N III I I I I! I I I Ml 

497 YVCNDCGKAFSQS SS LI YHQRIHKGEKPYECLQCGKAFSMSTQLTIHQRVHTGERPYKCN 556 


QY 132 LCQEFFGLNN CSS SNRLDGHQRVH 155 

I : I I : II | : | | | | : | 

557 ECGKAFSQNSTLFQHQIIHAGVKPYECSECGKAFSRSSYLIEHQRIH 603 


Db 


RESULT 15 
S70006 

finger protein zfOCl - guinea pig 

C; Species: Cavia porcellus (guinea pig) 

C;Date: 06-Dec-1996 #sequence_revision 13-Mar-1997 #text change 05-Nov-1999 
C;Accession: S70006 ~ 
R;Rivolta, M.N.; Negrini, C; Wilcox, E.R. 
Biochim. Biophys . Acta 1306, 127-132, 1996 

A;Title: A novel zinc finger gene preferentially expressed in the retina and the 
organ of Corti localizes to human chromosome 12q24.3. 
A;Reference number: S70006; MUID : 96221281; PMID: 8634327 
A;Accession: S70006 
A; Status : preliminary 


A;Molecule type: mRNA 
A; Residues: 1-261 <RIV> 

A/Cross-references: EMBL:L26335; NID : gl237277 ; PIDN: AAC42091 . 1 ; PID-al237278 
C; Keywords: zinc " y 

Query Match 14.8%; Score 131; DB 2; Length 261; 

Best Local Similarity 28.8%; Pred. No. 3.4e-05; 

Matches 42; Conservative 16; Mismatches 44; Indels 44; Gaps 7; 
QY 45 ^ IAA YKCG LCAAAQWDFGNT— MCQHQRVHGHHHHSY 8 0 
Db 85 KPYKCYECGKAFNWSPHLQIHMRVHTGEKPYVCSECGRGFSNSSNLCMHQRVH-TGEKPF 143 
QY 81 KCGLCTRS QKEGLHYTCSSHFPYSQYQFWKNF QTLKIHQRVHGGGGSYK 129 


Db 


Ml:: : : | - || I : II 


144 KCEECGKAFRHTSSLCMHQRVH TGEKPYKCYECG^FSQSSSLCIHQRv^ 200 


130 CGLCQEFFGLNNCSSSNRLDGHQRVH 155 
I 1:1 I I : I I I II I 

Db 201 CCGCGKAF SQSSSLCIHQRVH 221 


Search completed: September 17, 2004, 16:49:57 
Job time : 26.5458 sees 


GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 
Run on: 


Title: 

Perfect score: 
Sequence : 

Scoring table: 


Searched: 


September 17, 2004, 16:48:23 ; Search time 95.2644 Seconds 

(without alignments) 
529.242 Million cell updates/sec 

US-10-057-890A-31 
884 

1 MKVSVAALSCLMLVTALGSM GLNNCSSSNRLDGHQRVHAA 157 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1342398 seqs, 321133274 residues 


Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


1342398 


Database 


1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 


Published_Applications AA: 


/ cgn2_6/ptodata/2/pubpaa/US 07_PUBCOMB . pep : * 
/ cgn2_6/ptodata/2 /pubpaa/ PCT_NEW_PUB . pep : * 
/ cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep: * 
/ cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 
/ cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB . pep : * 
/cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep: * 
/ cgn2_6/ ptodata/2/pubpaa/US 08_NEW_PUB . pep : * 
/ cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB . pep : * 
/cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: * 
/cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: 
/ cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: 
/ cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB. pep : * 
/ cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: 
/cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: 
/cgn2__6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: 
/ cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: * 
/ cgn2_6/ptodata/2/pubpaa/US60_NEW~PUB.pep : * 
/cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: * 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 


RESULT 1 

US-10-057-890A-31 

; Sequence 31, Application US/10057890A 

; Publication No. US20030044 901A1 

; GENERAL INFORMATION: 

; APPLICANT: Coleman, Timothy 


; APPLICANT: Mansfield, Brian 

; TITLE OF INVENTION: Scaffold Fusion Polypeptides, Composition for Making the 
Same, and Methods 

; TITLE OF INVENTION: of Using the Same. 

FILE REFERENCE: PF537 
; CURRENT APPLICATION NUMBER: US/ 1 0/ 057 , 8 90A 
; CURRENT FILING DATE: 2002-01-29 
; PRIOR APPLICATION NUMBER: 60/265,782 
; PRIOR FILING DATE: 2001-01-31 
; PRIOR APPLICATION NUMBER: 60/265,858 
; PRIOR FILING DATE: 2001-01-31 
; NUMBER OF SEQ ID NOS: 32 
; SEQ ID NO 31 

LENGTH: 157 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-057-890A-31 


Query Match 100.0%; Score 884; DB 14; Length 157; 

Best Local Similarity 100.0%; Pred. No. 1.3e-79; 

Matches 157; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Q y 1 MKVSVAALSCLMLVTALGSMDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQW 60 

I I I I M I I I I I II I I I | M I I I I I t I I I I I I I I I I I I II I I I I I I I I I I I 

MKVSVAALSCLML VTALGSMD YQVS S PI YDIN YYT SEPCQKINVKQI AAYKCGLCAAAQW 60 


Db 1 


Q y 61 DFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQR 12 0 

HI' M , I II I I I I | I | l| I IN I II I I I I ! I I | , I I I I I I | | M I I 

Db 61 DFGNTMCQHQRVHGHHHHSYKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQR 12 0 

Qy 121 VHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVHAA 157 

I I M M I I I I I I I I I I I I I I | | | | | | | | | | || | | | | | 

Db 121 VHGGGGSYKCGLCQEFFGLNNCSSSNRLDGHQRVHAA 157 

RESULT 2 

US-10-057-890A-10 

Sequence 10, Application US/10057890A 
Publication No. US20030044901A1 
GENERAL INFORMATION: 
APPLICANT: Coleman, Timothy 
APPLICANT: Mansfield, Brian 

TITLE OF INVENTION: Scaffold Fusion Polypeptides, Composition for Making the 
Same, and Methods 

TITLE OF INVENTION: of Using the Same. 
FILE REFERENCE: PF537 

CURRENT APPLICATION NUMBER: US/10/057, 890A 
CURRENT FILING DATE: 2002-01-29 
PRIOR APPLICATION NUMBER: 60/265,782 
PRIOR FILING DATE: 2001-01-31 
PRIOR APPLICATION NUMBER: 60/265,858 
PRIOR FILING DATE: 2001-01-31 
NUMBER OF SEQ ID NOS: 32 
SEQ ID NO 10 
LENGTH: 138 
TYPE: PRT 

ORGANISM: Homo sapiens 


US-10-057-890A-10 


Query Match 90.2%; Score 797; DB 14; Length 138; 

Best Local Similarity 100.0%; Pred. No. 4.6e-71; 

Matches 138; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 79 

I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAAYKCGLCAAAQWDFGNTMCQHQRVHGHHHHS 60 

Qy 80 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 139 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 YKCGLCTRSQKEGLHYTCSSHFPYSQYQFWKNFQTLKIHQRVHGGGGSYKCGLCQEFFGL 12 0 

Qy 140 NNCSSSNRLDGHQRVHAA 157 

I II I II I I I I I I I I I I I I 
Db 121 NNCSSSNRLDGHQRVHAA 138 


RESULT 3 
US-09-725-285-2 

; Sequence 2, Application US/09725285 

; Patent No. US2 0010000241A1 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven, M. 

; TITLE OF INVENTION: Antibodies to Human G-Protein Chemokine Receptor 
HDGNR10 

; TITLE OF INVENTION: (CCR5 Receptor) 

; FILE REFERENCE: 1488.1150003 

; CURRENT APPLICATION NUMBER: US/09/725 , 285 

; CURRENT FILING DATE: 2000-11-29 

; PRIOR APPLICATION NUMBER: 09/339,912 

; PRIOR FILING DATE: 1999-06-25 

; PRIOR APPLICATION NUMBER: 09/195,662 

; PRIOR FILING DATE: 1998-11-18 

; PRIOR APPLICATION NUMBER: 08/466,343 

; PRIOR FILING DATE: 1995-06-06 

; NUMBER OF SEQ ID NOS : 9 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 2 

; LENGTH: 352 

; TYPE: PRT 

; ORGANISM: Artificial Sequence: Genomic 
FEATURE: 

; OTHER INFORMATION: Deduced Amino Acid Sequence 
US-09-725-285-2 


Query Match 42.9%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 


50 


YKCGLC 


•AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 


Db 


61 


LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 


84 


LCTRSQKEGLHYTC 97 



Db 


119 


IILLTIDRYLAWHAVFALKARTVTFGWTSVITWWAVFASLPGIIFTRSQKEGLHYTC 17 8 


Qy 


98 



Db 


179 


Qy 


125 


GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 154 



Db 


239 


TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 4 
US-09-759-841-2 

; Sequence 2, Application US/09759841 

; Patent No. US2 0010039026A1 

; GENERAL INFORMATION: 

; APPLICANT: Rickett, Graham A 

; APPLICANT: Dobbs, Susan 

APPLICANT: Perros, Manoussos 
; TITLE OF INVENTION: Assay Method 
; FILE REFERENCE: PC10348APME 
; CURRENT APPLICATION NUMBER: US/ 09/759, 841 
; CURRENT FILING DATE: 2001-01-12 
; PRIOR APPLICATION NUMBER: GB 0000661.9 

PRIOR FILING DATE: 2000-01-12 
; PRIOR APPLICATION NUMBER: GB 0000663.5 

PRIOR FILING DATE: 2000-01-12 
; PRIOR APPLICATION NUMBER: GB 0000659.3 
; PRIOR FILING DATE: 2000-01-12 
; NUMBER OF SEQ ID NOS : 6 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 2 

LENGTH: 352 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-759-841-2 

Query Match 42.9%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I II I I I I I I I I I ! I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I II I I I I I I I : I : : I 

Db 61 LKSMTDI YLLNLAI SDLFFLLTVP FWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 

Db 


84 
119 


LCTRSQKEGLHYTC 97 

: I I I I I I I I II I I 

IILLTIDRYLAWHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 


Qy 


98 



Db 


179 


Qy 


125 


GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 



Db 


239 


TIMI VYFLFWAPYNIVLLLNTFQEFFGLNNCS S SNRLDQAMQV 281 


RESULT 5 

US-09-779-879A-22 

; Sequence 22, Application US/09779879A 

; Patent No. US20020048786A1 

; GENERAL INFORMATION: 

; APPLICANT: Rosen, Craig A. 

; APPLICANT: Roschke, Viktor 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven, M. 

; TITLE OF INVENTION: Human G-protein Chemokine Receptor (CCR5) HDGNR10 

; FILE REFERENCE: 1488.115000A 

; CURRENT APPLICATION NUMBER: US/ 09/779, 879A 

; CURRENT FILING DATE: 2001-02-09 

; PRIOR APPLICATION NUMBER: US 60/181,258 

; PRIOR FILING DATE: 2000-02-09 

; PRIOR APPLICATION NUMBER: US 60/187,999 

; PRIOR FILING DATE: 2000-03-09 

; PRIOR APPLICATION NUMBER: US 60/234,336 

; PRIOR FILING DATE: 2000-09-22 

; NUMBER OF SEQ ID NOS : 58 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 22 
LENGTH: 352 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-779-879A-22 

Query Match 42.9%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I II I I I I I I I I I I II I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 8 4 LCTRSQKEGLHYTC 97 

: I I I I I I II I I I I 

Db 119 IILLTIDRYIAWHAVFALKARTWFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I II I I I I I I I I I I : : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNT FQEFFGLNNCS S SNRLDQAMQV 281 


RESULT 6 

US-09-779-880A-22 

Sequence 22, Application US/09779880A 
Patent No. US20020061834A1 
GENERAL INFORMATION: 
APPLICANT: Rosen, Craig A. 
APPLICANT: Roschke, Viktor 
APPLICANT: Li, Yi 
APPLICANT: Ruben, Steven, M. 

TITLE OF INVENTION: Human G-protein Chemokine Receptor (CCR5) KDGNR10 
FILE REFERENCE: 1488.115000C 

CURRENT APPLICATION NUMBER: US/09/779, 880A 
CURRENT FILING DATE: 2001-02-09 
PRIOR APPLICATION NUMBER: US 60/181,258 
PRIOR FILING DATE: 2000-02-09 
PRIOR APPLICATION NUMBER: US 60/187,999 
PRIOR FILING DATE: 2000-03-09 
PRIOR APPLICATION NUMBER: US 60/234,336 
PRIOR FILING DATE: 2000-09-22 
NUMBER OF SEQ ID NOS : 58 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 22 
LENGTH: 352 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-779-880A-22 

Query Match 42.9%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I II I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I 1 I I I I I I I I I I 

Db 119 IILLTIDRYLAWHAVFALKARTVTFGVAn?SVITWVVAVFASLPGIIFTRSQKEGLHYTC 17 8 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I II I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNT FQEFFGLNNCS S SNRLDQAMQV 281 


RESULT 7 

US-09-813-653-15 

; Sequence 15, Application US/09813653 

; Patent No. US2 002 006477 OAl 

; GENERAL INFORMATION: 

; APPLICANT: Nestor, John 

; APPLICANT: Wilson, Carol 

; APPLICANT: See, Raymond 

; APPLICANT: Tan Hehir, Christina 

; TITLE OF INVENTION: Binding Compounds and Methods For Identifying Binding 
Compounds 

; FILE REFERENCE: CNS-005 

; CURRENT APPLICATION NUMBER: US/09/ 813, 653 
; CURRENT FILING DATE: 2001-03-20 

PRIOR APPLICATION NUMBER: US 60/190,946 
; PRIOR FILING DATE: 2000-03-21 
; PRIOR APPLICATION NUMBER: US 60/190,996 
; PRIOR FILING DATE: 2000-03-21 
; PRIOR APPLICATION NUMBER: US 60/191,299 
; PRIOR FILING DATE: 2000-03-21 
; NUMBER OF SEQ ID NOS : 44 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 15 
; LENGTH: 352 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-813-653-15 

Query Match 42.9%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I II I I I I I I I I I I I I I I I II I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 5 0 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I II I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYIAVVHAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 17 8 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 8 

US-09-813-653-17 

; Sequence 17, Application US/09813653 
; Patent No. US20020064770A1 


; GENERAL INFORMATION: 

; APPLICANT: Nestor, John 

; APPLICANT: Wilson, Carol 

; APPLICANT: See, Raymond 

; APPLICANT: Tan Hehir, Christina 

; TITLE OF INVENTION: Binding Compounds and Methods For Identifying Binding 
Compounds 

; FILE REFERENCE: CNS-005 

; CURRENT APPLICATION NUMBER: US/09/813, 653 

; CURRENT FILING DATE: 2001-03-20 

; PRIOR APPLICATION NUMBER: US 60/190,946 

; PRIOR FILING DATE: 2000-03-21 

; PRIOR APPLICATION NUMBER: US 60/190,996 

; PRIOR FILING DATE: 2000-03-21 

; PRIOR APPLICATION NUMBER: US 60/191,299 

; PRIOR FILING DATE: 2000-03-21 

; NUMBER OF SEQ ID NOS : 44 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 17 

LENGTH: 352 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-813-653-17 

Query Match 42.9%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MD YQVS S P I YDINYYTSEPCQKINVKQIAA 49 

I I I I I I II I I I I I I I I I I I I I I II I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILIQINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I II I I I I I I I II 

Db 119 1 1 LLT I DRYLAVVHAVFALKART VT FGWT S VI TWWAVFAS LP GI I FTRSQKEGLH YT C 17 8 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I II I I I I II I I I I I I I I I : : I 

Db 17 9 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I II I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 9 
US-09-796-202-1 

; Sequence 1, Application US/09796202 

; Patent No. US20020068813A1 

; GENERAL INFORMATION: 

; APPLICANT: Dragic, Tatjana 

; APPLICANT: Olson, William 

; TITLE OF INVENTION: SULFATED CCR5 PEPTIDES FOR HIV-1 INFECTION 


FILE REFERENCE: 2048/ 61010/ JPW/SHS 
CURRENT APPLICATION NUMBER: US/09/796,202 
CURRENT FILING DATE: 2001-02-28 
NUMBER OF SEQ ID NOS : 17 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 1 
LENGTH: 352 
TYPE: PRT 
ORGANISM: human 
US-09-796-202-1 

Query Match 42.9%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I II I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 8 4 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 1 1 LLT I DRYLAVVHAVFALKARTVT FGWTSVITWWAVFAS LPGI I FTRSQKEGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I II : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 10 
US-09-195-662A-2 

; Sequence 2, Application US/09195662A 

; Patent No. US2002 0076745A1 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven, M. 

; TITLE OF INVENTION: Human G-Protein Chemokine Receptor HDGNR10 (CCR5 
Receptor) 

; FILE REFERENCE: 1488.1150002 

; CURRENT APPLICATION NUMBER: US/ 09/195, 662A 

; CURRENT FILING DATE: 1998-11-18 

; PRIOR APPLICATION NUMBER: 08/466,343 

; PRIOR FILING DATE: 1995-06-06 

; NUMBER OF SEQ ID NOS: 9 

; SOFTWARE: Patentln version 3.0 

; SEQ ID NO 2 

; LENGTH: 352 

; TYPE: PRT 

; ORGANISM: Artificial Sequence: Genomic 
FEATURE: 


OTHER INFORMATION: Deduced Amino Acid Sequence 
US-09-195-662A-2 

Query Match 42.9%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I I II II I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I II 

Db 119 IILLTIDRYIAWHAVFALKARTWFGV^SVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I II I II I I I I I I II I I I I I I :: I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 11 
US-09-339-912A-2 

; Sequence 2, Application US/09339912A 

; Patent No. US20020099176A1 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven, M. 

; TITLE OF INVENTION: Antibodies to Human G-Protein Chemokine Receptor 
HDGNR10 

; TITLE OF INVENTION: (CCR5 Receptor) 

; FILE REFERENCE: 1488.1150003 

; CURRENT APPLICATION NUMBER: US/09/339 , 912A 

; CURRENT FILING DATE: 1999-06-25 

; PRIOR APPLICATION NUMBER: 09/195,662 

; PRIOR FILING DATE: 1998-11-18 

; PRIOR APPLICATION NUMBER: 08/466,343 

; PRIOR FILING DATE: 1995-06-06 

; NUMBER OF SEQ ID NOS : 9 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 2 

LENGTH: 352 

TYPE: PRT 

; ORGANISM: Artificial Sequence: Genomic 
FEATURE: 

; OTHER INFORMATION: Deduced Amino Acid Sequence 
US-09-339-912A-2 


Query Match 42.9%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 


Matches 99; Conservative 


7; Mismatches 27; Indels 150; Gaps 


7; 


Db 


Qy 



Db 


Qy 


50 YKCGLC — AAAQWDFGNTMCQHQRVHGHHHHSYKCG 8 3 

I I I I I I I I I I I I I I I : I : : I 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 


84 LCTRSQKEGLHYTC 97 


Db 


119 1 1 LLT I DRYLAWHAVFALKARTVT FGWT S VI TWWAVFAS L PGI I FT RSQKEGLH YTC 17 



Qy 


Db 



Qy 


125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 



Db 


239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 12 
US-09-938-719-5 

; Sequence 5, Application US/09938719 
; Patent No. US2002 0106742A1 
GENERAL INFORMATION: 


TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 


NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 

STREET: 620 Newport Center Drive 16th Floor 

CITY: Newport Beach 

STATE: CA 

COUNTRY: U.S.A. 

ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/ 938 , 7 19 

FILING DATE: 24-Aug-2001 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/626,939 

FILING DATE: 27-JULY-2000 
ATTORNEY/AGENT INFORMATION: 

NAME: Altman, Daniel E 

REGISTRATION NUMBER: 34,115 


APPLICANT: SAMSON, MICHEL 


PARMENTIER, MARC 
VASSART, GILBERT 
LIBERT, FREDERICK 


AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 


REFERENCE/ DOCKET NUMBER: <Unknown> 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-938-719-5 

Query Match 42.9%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I II I I I I I I I I I I II I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 5 0 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 8 3 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: MINIMUM 

Db 119 IILLTIDRYLAVAmAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 17 8 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I M I I M II I I M M I I I I :: I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I II M I I I II II II I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 13 
US-09-939-226-5 

; Sequence 5, Application US/09939226 
; Patent No. US2 00201108 05A1 
GENERAL INFORMATION: 

APPLICANT: SAMSON, MICHEL 
; PARMENTIER, MARC 

; VASSART, GILBERT 

LIBERT, FREDERICK 
TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 

AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES: 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 62 0 Newport Center Drive 16th Floor 
; CITY: Newport Beach 

STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 


OPERATING SYSTEM: PODOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/939,226 
FILING DATE: 24-Aug-2001 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/626,939 
FILING DATE: 2000-07-27 
ATTORNEY/AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/DOCKET NUMBER: <Unknown> 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-939-226-5 

Query Match 42.9%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I II I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYLAVVHAVFALKARTvTFGVVTSVI TWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I II I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 14 
US-09-938-703-5 

; Sequence 5, Application US/09938703 

; Patent No. US20020110870A1 

; GENERAL INFORMATION: 

; APPLICANT: SAMSON, MICHEL 

; PARMENTIER, MARC 

VASSART, GILBERT 

LIBERT, FREDERICK 

; TITLE OF INVENTION: ACTIVE AND INACTIVE CC-CHEMOKINES RECEPTOR 


AND NUCLEIC ACID MOLECULES ENCODING SAID RECEPTOR 
NUMBER OF SEQUENCES : 17 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Knobbe, Martens, Olson & Bear 
STREET: 620 Newport Center Drive 16th Floor 
CITY: Newport Beach 
STATE: CA 
COUNTRY: U.S.A. 
ZIP: 92660 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 938 , 7 03 
FILING DATE: 24-Aug-2001 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/626,939 
FILING DATE: 2000-07-27 
ATTORNEY/ AGENT INFORMATION: 
NAME: Altman, Daniel E 
REGISTRATION NUMBER: 34,115 
REFERENCE/DOCKET NUMBER: <Unknown> 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 352 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-938-703-5 

Query Match 42.9%; Score 37 9; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 

Qy 2 0 MD YQVS S P I YDIN YYT S EP CQKINVKQI AA 4 9 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I 1111111111111:1: : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 84 ■ LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 IILLTIDRYIAWHAVFALKARTVTFGVVTSVITWWAVFASLPGIIFTRSQKEGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I :: I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I II I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 15 
US-09-502-783A-2 

; Sequence 2, Application US/09502783A 

; Patent No. US2 00201322 69A1 

; GENERAL INFORMATION: 

; APPLICANT: Li, Yi 

; APPLICANT: Ruben, Steven M. 

; TITLE OF INVENTION: Polynucleotides Encoding Human G-Protein Chemokine 
Receptor (CCR5) 

; TITLE OF INVENTION: HDGNR10 
; FILE REFERENCE: 1488.1150006 

; CURRENT APPLICATION NUMBER: US/ 09/502 , 7 8 3A 

CURRENT FILING DATE: 2001-08-23 
; PRIOR APPLICATION NUMBER: 08/466,343 
; PRIOR FILING DATE: 1995-06-06 
; NUMBER OF SEQ ID NOS : 9 
; SOFTWARE: Patentln version 3.0 
; SEQ ID NO 2 

LENGTH: 352 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-502-783A-2 


Query Match 42.9%; Score 379; DB 9; Length 352; 

Best Local Similarity 35.0%; Pred. No. 3.1e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7 

Qy 2 0 MD YQVS S P I YDI N Y YT S E P CQK I NVKQ I AA 49 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 1 MDYQVSSPIYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

Qy 5 0 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 8 3 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 84 LCTRSQKEGLHYTC 97 

: I II I I I I I I I I I 

Db 119 1 1 LLT I DRYIiAVVHAVFALKARTVT FGWT S VI TWWAVFAS LPGI I FTRS QKEGLH YTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I II I I I I I II I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


Search completed: September 17, 2004, 17:12:54 
Job time : 97.2644 sees 


GenCore version 5.1»6 
Copyright (c) 1993 - 2004 Compugen Ltd. 


OM protein - protein search, using sw model 
Run on: 


Title: 

Perfect score: 
Sequence: 


September 17, 2004, 16:42:06 ; Search time 76.1051 Seconds 

(without alignments) 
650.894 Million cell updates/sec 

US-10-057-890A-31 
884 

1 MKVSVAALSCLMLVTALGSM GLNNCSSSNRLDGHQRVHAA 157 


Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1017041 seqs, 315518202 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 


1017041 


Database 


SPTREMBL 25:* 


1 

2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 


sp_archea : * 
sp_bacteria : * 
sp_f ungi : * 
sp_human : * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc : * 
sp_organelle : * 
sp_phage : * 

sp_plant : * 

sp_rodent : * 

sp_virus : * 

sp_vertebrate : * 

sp_unclassif ied: * 

sp_rvirus : * 

sp_bacteriap : * 

sp_archeap : * 


Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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Match Length DB 


ID 
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1 

373 

42 . 

2 

352 

6 

018772 

018772 

pan troglod 

2 

370 

41. 

9 

344 

6 

Q9TQR8 

Q9tqr8 

cercocebus 

3 

370 

41. 

9 

344 

6 

077833 

077833 

cercocebus 

4 

369 

41. 

7 

352 

6 

Q95NC5 

Q95nc5 

hylobates s 

5 

369 

41. 

7 

352 

6 

Q9TV50 

Q9tv50 

pan troglod 

6 

367 

41. 

5 

352 

6 

Q95NC3 

Q95nc3 

miopithecus 

7 

367 

41. 

5 

352 

6 

018771 

018771 

pan troglod 

8 

366 

41. 

4 

352 

6 

Q9TSK1 

Q9tskl 

cercopithec 

9 

366 

41. 

4 

352 

6 

Q9TV4 9 

Q9tv49 

cercocebus 

10 

366 

41. 

4 

352 

6 

Q9XT13 

Q9xtl3 

papio anubi 

11 

366 

41. 

4 

352 

6 

Q9TV45 

Q9tv45 

cercopithec 

12 

366 

41. 

4 

352 

6 

Q95NE8 

Q95ne8 

cercopithec 

13 

365 

41. 

3 

352 

6 

Q9XT14 

Q9xtl4 

colobus gue 

14 

365 

41. 

3 

352 

6 

Q95NC6 

Q95nc6 

trachypithe 

15 

365 

41. 

3 

352 

6 

Q95NC8 

Q95nc8 

colobus pol 

16 

364 

41. 

2 

352 

6 

Q9XS99 

Q9xs99 

gorilla gor 

17 

363 

41. 

1 

352 

6 

097975 

097975 

macaca arct 

18 

363 

41. 

1 

352 

6 

Q9MZA2 

Q9mza2 

cercopithec 

19 

362 

41. 

0 

352 

6 

Q95NC1 

Q95ncl 

theropithec 

20 

362 

41. 

0 

352 

6 

Q95NC0 

Q95nc0 

hylobates m 

21 

362 

41. 

0 

352 

6 

Q9TV46 

Q9tv46 

cercopithec 

22 

362 

41. 

0 

352 

6 

097962 

097962 

pygathrix a 

23 

361 

40. 

8 

352 

6 

Q95NC7 

Q95nc7 

nasalis lar 

24 

361 

40. 

8 

352 

6 

Q9TV42 

Q9tv42 

cercopithec 

25 

361 

40. 

8 

352 

6 

077776 

077776 

cercocebus 

26 

361 

40. 

8 

352 

6 

Q95NE1 

Q95nel 

cercocebus 

27 

361 

40. 

8 

352 

6 

Q9TQX0 

Q9tqx0 

cercopithec 

28 

359 

40. 

6 

352 

6 

Q9TV43 

Q9tv43 

cercopithec 

29 

358 

40. 

5 

352 

6 

018770 

018770 

pan troglod 

30 

357 

40. 

4 

352 

6 

Q9TV47 

Q9tv47 

cercopithec 

31 

357 

40. 

4 

352 

6 

Q9BGN5 

Q9bgn5 

cercopithec 

32 

357 

40. 

4 

352 

6 

Q9XT12 

Q9xtl2 

cercopithec 

33 

357 

40. 

4 

352 

6 

Q9TSQ7 

Q9tsq7 

cercopithec 

34 

356 

40. 

3 

352 

6 

Q95ND2 

Q95nd2 

mandrillus 

35 

356 

40. 

3 

352 

6 

Q95ND1 

Q95ndl 

mandrillus 

36 

355 

40. 

2 

352 

6 

Q9XT76 

Q9xt76 

cercopithec 

37 

355 

40. 

2 

352 

6 

Q9TV44 

Q9tv44 

cercopithec 

38 

355 

40. 

2 

352 

6 

Q9XS35 

Q9xs35 

macaca neme 

39 

355 

40. 

2 

352 

6 

Q9MZA3 

Q9mza3 

hylobates a 

40 

355 

40. 

2 

352 

6 

Q9TV93 

Q9tv93 

macaca arct 

41 

355 

40. 

2 

352 

6 

Q95ND0 

Q95nd0 

erythrocebu 

42 

346 

39. 

1 

352 

6 

Q9BGN6 

Q9bgn6 

cercopithec 

43 

345 

39. 

0 

339 

4 

Q9UN25 

Q9un25 

homo sapien 

44 

344 

38. 

9 

339 

4 

Q9UN23 

Q9un23 

homo sapien 

45 

344 

38. 

9 

339 

4 

Q9UN27 

Q9un27 

homo sapien 


ALIGNMENTS 


RESULT 1 
018772 

ID 018772 PRELIMINARY; PRT; 352 AA. 

AC 018772; 

DT 01-JAN-1998 (TrEMBLrel. 05, Created) 

DT 01-JAN-1998 (TrEMBLrel. 05, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 


DE CCR5 receptor (Fragment) . 

GN CCR5 . 

OS Pan troglodytes (Chimpanzee) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 

OX NCBI_TaxID=9598; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=ChCCR5-142a; 

RA Zhang L., Carruthers CD., He T . , Huang Y. , Cao Y., Wang G., Hahn B . r 

RA Ho D.D.; 

RT "HIV-1 subtypes, co-receptor usage, and CCR5 polymorphism."; 

RL AIDS Res. Hum. Retroviruses 0:0-0(1997). 

DR EMBL; AF011541; AAB65741.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

FT NON_TER 352 352 

SQ SEQUENCE 352 AA; 40598 MW; 39688FA7004C952F CRC64; 

Query Match 42.2%; Score 373; DB 6; Length 352; 

Best Local Similarity 34.6%; Pred. No. 1.7e-33; 

Matches 98; Conservative 8; Mismatches 27; Indels 150; Gaps 7; 

Qy 20 MDYQVS S P I YDI N Y YT SEP CQKINVKQ I AA 49 

I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSS PI YDIDYYTSEPCQKIN VKQIAARLLPPLYSLVFI FGFVDNMLVILI LINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I II I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 1 1 LLT I DRYLAI VHAVFALKARTVT FGWT S VI TWWAVFAS L P GI I FTRS QKEGLH YTC 17 8 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVI LGLVLPLLVMVT CYSGILKTLLRCRNEKKRHRAVRLI F 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYDIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 2 
Q9TQR8 

ID Q9TQR8 PRELIMINARY; PRT; 3 44 AA. 

AC Q9TQR8; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 


DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Chemokine receptor CCR5 . 

GN CCR5 . 

OS Cercocebus torquatus torquatus, and 

OS Cercocebus torquatus atys (Red-crowned mangabey) (Sooty mangabey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercocebus. 

OX NCBI_TaxID=81944, 9531; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC SPECIES=C. torquatus torquatus; STRAIN=1049, and 997; 

RA Chen Z., Kwon D., Jin Z., Monard S., Telfer P., Jones M. , Lu C, 

RA Aguilar R. , Ho D.D., Marx P. A.; 

RT "Natural infection of a homozygous delta 24 CCR5 red-capped mangabey 

RT with a R2b-tropic simian immunodeficiency virus."; 

RL J. Exp. Med. 0:0-0(1998). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC SPECIES=C. torquatus atys; 

RA Palacios E., Digilio L., McClure H.M., Chen Z., Marx P. A., 

RA Goldsmith M.A. , Grant R.M. ; 

RT "Parallel evolution of CCR5-null phenotypes in humans and in a natural 

RT host of simian immunodeficiency virus."; 

RL Curr. Biol. 0:0-0(1998). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC SPECIES=C. torquatus torquatus; STRAIN=RCM4 11 ; 

RA Beer B.E., Kuiken C.L., Tooze Z., Foley B.T., Goeken R.M., Brown C.R., 

RA St Claire M. , Hirsch V.M. ; 

RT "Characterization of novel simian immunodeficiency viruses from 

RT redcapped mangabeys from Nigeria ( SIVrcmNgM411 and NgD409) ."; 

RL Submitted (FEB-2001) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AF094753; AAC62474.1; -. 

DR EMBL; AF079473; AAC31194.1; -. 

DR EMBL; AF084003; AAC62471.1; -. 

DR EMBL; AF349683; AAK69685.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO:0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN__RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 344 AA; 39592 MW; E15F5F601191A4D1 CRC64; 

Query Match 41.9%; Score 370; DB 6; Length 344; 

Best Local Similarity 34.9%; Pred. No. 3.5e-33; 

Matches 96; Conservative 9; Mismatches 28; Indels 142; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I I : I I I I I I II I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I II I I I I I I I I I I I : I : : | 

Db 61 LKSMTDI YLLNLAI SDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFI GFFSGI FF 118 

Qy 84 LCTRSQKEGLHYTCSSHFPYSQ 105 

: I I I I : I I I I I I II I I I | I I I 
Db 119 I ILLTI DRYLAIVHAVFALKARTVTFGVVAVFASLPGI I FTRSQREGLHYTCS SHFPYSQ 178 

Qy 106 YQFWKNFQTLKI HQRVHGG 124 

I I I I I I I II I I I : : I 

Db 179 YQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIFTIMIVYFL 238 

Qy 125 -GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I II I I : I 

Db 239 FWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 273 


RESULT 3 
077833 

ID 077833 PRELIMINARY; PRT; 344 AA. 

AC 077833; 

DT 01-NOV-1998 (TrEMBLrel. 08, Created) 

DT 01-NOV-1998 (TrEMBLrel. 08, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Chemokine receptor CCR5 . 

GN CCR5 . 

OS Cercocebus torquatus torquatus . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercocebus. 

OX NCBI_TaxID=81944; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=1208, and 009; 

RA Chen Z., Kwon D., Jin Z., Monard S., Telfer P., Jones M. , Lu C, 

RA Aguilar R. , Ho D.D., Marx P. A.; 

RT "Natural infection of a homozygous delta 24 CCR5 red-capped mangabey 

RT with a R2b-tropic simian immunodeficiency virus."; 

RL J. Exp. Med. 0:0-0(1998). 

DR EMBL; AF094752; AAC62473.1; -. 

DR EMBL; AF084002; AAC62470.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO:0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP__Fl_2 ; 1. 

SQ SEQUENCE 344 AA; 39578 MW; 0CEC05E47C2F6DE6 CRC64; 

Query Match 41.9%; Score 370; DB 6; Length 344; 

Best Local Similarity 34.9%; Pred. No. 3.5e-33; 

Matches 96; Conservative 9; Mismatches 28; Indels 142; Gaps 7; 


Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 


49 


II II I II I I I I : I I I I I I I I I I I I I I I I I 
Db 1 MD YQVS S PT YD I DYYT S E PCQKINVKQ I AARLL PPLYS LVFVFGFVGN I LWLI L INCKR 60 

Qy 5 0 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I II I I I I : I : : | 

Db 61 LKSMTDI YLLNLAI SDLLFLLTVP FWAH YAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 84 LCTRSQKEGLHYTCSSHFPYSQ 105 

: I I I I : I I 1 I I I I I I I I I I I I 
Db 119 IILLTIDRYLAIVHAVFALKARTVTFGWAVFASLPGIIFTRSQREGLHYTCSSHFPYSQ 178 

Qy 106 YQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I : : | 

Db 179 YQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLI FT IMIVYFL 238 

Qy 125 -GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I :| 

Db 239 FWAP YNIVLLLNTFQEFFGLNNCS S SNRLDQAMQV 273 


RESULT 4 
Q95NC5 

ID Q95NC5 PRELIMINARY; PRT ; 352 AA. 

AC Q95NC5; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Hylobates syndactylus (Siamang) ( Symphalangia syndactylus) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates. 

OX NCBI_TaxID=95 90; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A., Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF177884; AAK43367.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40508 MW; F4F64B3AD5AF658A CRC64; 

Query Match 41.7%; Score 369; DB 6; Length 352; 

Best Local Similarity 34.3%; Pred. No. 4.7e-33; 

Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7; 


Qy 


2 0 MDYQVS S PI YDINYYT SE PCQKINVKQ I AA- 
I I I I I I I I I I I : I I I I I I II I I I I I I I I I 


49 


Db 


1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 


Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I II I I I : I : : | 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I II 

Db 119 1 1 L LT I D R YLAI VHAVFAL KART VT F G WT S VI T WWAVFAS LPGIIFTRSQKEGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I :: I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I II I I M I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 5 
Q9TV50 

ID Q9TV50 PRELIMINARY; PRT; 352 AA. 

AC Q9TV50; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5. 

GN CCR5 . 

OS Pan troglodytes (Chimpanzee) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 

OX NCBI_TaxID=9598; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=1410; 

RX MEDLINE=99335215; PubMed=104 087 30 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.C., 

RA Diop O., Rigoulet J., Barre-Sinoussi F., Fomsgaard A.; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates. 11 ; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=1410; 

RA Mueller-Trutwin M.C., Corbet S., Hansen J., Diop O., 

RA Georges-Courbot M.-C, Barre-Sinoussi F. , Fomsgaard A.; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AF035214; AAD44007.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_JL ; 1. 

DR PROSITE; PS50262; G PROTEIN RECEP Fl 2 ; 1. 


KW Receptor. 

SQ SEQUENCE 352 AA; 40481 MW; 2578A0F2C07A4A65 CRC64; 


Query Match 41.7%; Score 369; DB 6; Length 352; 

Best Local Similarity 34.3%; Pred. No. 4.7e-33; 

Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I M I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : | 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I I I I I I I I I 

Db 119 1 1 LLS I DRYLAI WAVFALKART VT FGWT S VI TWWAVFAS L PGI I FTRS QKEGLH YTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I II I I I I I : : | 

Db 17 9 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 12 5 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 154 

I I I I I II I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 6 
Q95NC3 

ID Q95NC3 PRELIMINARY; PRT; 352 AA. 

AC Q95NC3; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Miopithecus talapoin (Talapoin) ( Cercopithecus talapoin) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Miopithecus. 

OX NCBI_TaxID=36231; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A., Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny . 11 ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF177886; AAK43369.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP Fl 1; 1. 


DR PROSITE; PS502 62; G_PR0TEIN_RECEP_Fl_2 ; 1. 
KW Receptor. 

SQ SEQUENCE 352 AA; 40546 MW; 6464152F3E566AE5 CRC64; 

Query Match 41.5%; Score 367; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. 7.8e-33; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 


QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 


2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

M I I I I I I I I I I I I I II I I I I I I I I I I I 
1 MDYQVSSPTYDINYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


50 


-YKCGLC- 


AAAQWDFGNTMCQHQRVHGHHHHSYKCG 8 3 

I I I I I II I I I I I I I :: I : : | 

61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — RLLTGLYFIGFFSGIFF 118 

8 4 LCTRSQKEGLHYTC 97 

: I I I I : I I I I I I I 

119 IILLTIDRYLAIVmVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQREGLHYTC 178 

9 8 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I II I I I I I I II I I : : | 

179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I II I I I I I : I 

239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 7 
018771 
ID 
AC 


DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RC 
RA 
RA 
RT 
RL 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 


018771 PRELIMINARY; PRT; 352 AA. 

018771; 

01-JAN-1998 (TrEMBLrel. 05, 
01-JAN-1998 (TrEMBLrel. 05, 
01-JUN-2003 (TrEMBLrel. 24, 
CCR5 receptor (Fragment) . 
CCR5. 

Pan troglodytes (Chimpanzee) . 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
NCBIJTaxID=9598; 
[1] 

SEQUENCE FROM N.A. 
STRAIN=ChCCR5-141a; 
Zhang L., Carruthers 
Ho D.D. ; 

"HIV-1 subtypes, co-receptor usage, and CCR5 polymorphism. " ; 
AIDS Res. Hum. Retroviruses 0:0-0(1997). 
EMBL; AF011539; AAB65739.1; -. 

C: integral to membrane; IEA. 
F: receptor activity; IEA. 
F: rhodopsin-like receptor activity; IEA. 
P:G-protein coupled receptor protein signalin. 
InterPro; IPR000276; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 


Created) 

Last sequence update) 
Last annotation update) 


Craniata ; Vertebrata ; Euteleos tomi ; 
Catarrhini; Hominidae; Pan. 


CD., He T., Huang Y., Cao Y., Wang G., Hahn B., 


GO; GO:0016021; 

GO; GO:0004872; 

GO; GO: 0001584; 

GO; GO:0007186; 


IEA. 


DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS502 62; G_PROTEIN_RECEP_Fl__2 ; 1. 

KW Receptor. 

FT NON_TER 352 352 

SQ SEQUENCE 352 AA; 40466 MW; 3FFFAC7ABAE1D4FB CRC64 ; 


Query Match 41.5%; Score 367; DB 6; Length 352; 

Best Local Similarity 34.3%; Pred. No. 7.8e-33; 

Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I II I I I I I I I I : I I I I I I I | | | | | | | | | | 
Dn 1 MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHH S YKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISNLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


Qy 84 LCTRSQKEGLHYTC 97 

: Mill I I I I I I 

Db 119 IILLTIDRYIAIVmVFALKARTWFGVAATSVITWWAVFASLPGIIFTRSQKGGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I II I I I I II I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLV1WICYSGILKTLLRCRNEKKRHRAVRLIF 238 

QY 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I II I I I I I I I II I I : I 

Db 239 T IMI VYFLFWAPYNI VLLLNTFQEFFGLNNCS S SNRLDQAMQV 2 81 


RESULT 8 
Q9TSK1 

ID Q9TSK1 PRELIMINARY; PRT; 352 AA. 

AC Q9TSK1; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE G-protein coupled chemokine receptor. 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=9534; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Holtkamp N., Baier M. , Werner A.; 

RL Submitted (AUG-1997) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF019379; AAD01639.1; -. 

DR GO; GO:0016021; Crintegral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO:0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN RECEP_F1 1; 1. 


DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 
KW Receptor. 

SQ SEQUENCE 352 AA; 40588 MW; 0F18 69D9A6668DBB CRC64; 

Query Match 41.4%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. le-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I II I I I I : I I II I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: II I I : I I I I I I I 

Db 119 IILLTIDRYLAIVmVFALKARTArTFGVWSVITWWAVFASLPRIIFTRSQREGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I II I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I II I I I I II I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 9 
Q9TV4 9 

ID Q9TV49 PRELIMINARY; PRT; 352 AA. 

AC Q9TV4 9; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5 (C-C chemokine receptor 5) . 

GN CCR5. 

OS Cercocebus galeritus (Agile mangabey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Cercocebus. 

OX NCBI_TaxID=9532 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=4; 

RX MEDLINE=99335215; PubMed=104 08730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.C., 

RA Diop O., Rigoulet J., Barre-Sinoussi F., Fomsgaard A. ; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(199 9). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN-4; 

RA Mueller-Trutwin M.C., Corbet S., Hansen J. , Diop O., 

RA Georges-Courbot M.-C. , Barre-Sinoussi F. , Fomsgaard A. ; 


RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N . A. 

RA Zhang Y., Ryder O.A., Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny. " ; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF035215; AAD44008.1; -. 

DR EMBL; AF177898; AAK43381.1; -. 

DR GO; GO:0016021; Crintegral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS 002 37; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40479 MW; 5E1504A9B87278B2 CRC64; 

Query Match 41.4%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. le-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I II I I I I : I I I I I I I I I I II I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG- — 8 3 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDI YLLNLAI S DLLFLLTVPFWAH YAAAQWDFGNTMC — QLLTGL YFI GFFS GI FF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I : I I I I I I I 

Db 119 1 1 LLT I DRYLAI VHAVFALKARTVT FGWT S VI TWWAVFAS LP GI I FT RSQREGLH YT C 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 154 

I I I I I I II I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 10 
Q9XT13 

ID Q9XT13 PRELIMINARY; PRT; 352 AA. 

AC Q9XT13; 

DT 01-NOV-1999 (TrEMBLrel. 12, Created) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor 5. 

GN CCR5 . 

OS Papio anubis (Olive baboon). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 


OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Papio. 

OX NCBI_TaxID=9555; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Spearman P.W., Mburu D.N., Graham B.S.; 

RT "Differential Utilization of CCR5 Molecules from Three East African 

RT Simian Species by the HIV-l Envelope Glycoprotein."; 

RL Submitted (APR-1999) to the EMBL/GenBank/DDB J databases. 

DR EMBL; AF141640; AAD32685.1; 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP__F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40489 MW; 0B4 7E337C1 1E2E1E CRC64; 

Query Match 41.4%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. le-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPI YDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 

Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDNYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I : I I I I I I I 

Db 119 1 1 LLT I DR YLAI VHAVFAL KART VT FGWT S VI T WWAVFAS L P G 1 1 FT RS QRE GLH YT C 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

II I 1 I I ! II I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I III I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 11 
Q9TV4 5 

ID Q9TV45 PRELIMINARY; PRT; 352 AA. 

AC Q9TV45; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5. 

GN CCR5. 

OS Cercopithecus nictitans (white-nosed guenon) . 


DR 
DR 
DR 


OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=36228; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=7; 

RX MEDLINE=99335215; PubMed=104 0873 0; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.C., 

RA Diop 0., Rigoulet J., Barre-Sinoussi F., Fomsgaard A. ; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=7; 

RA Mueller-Trutwin M.C., Corbet S., Hansen J., Diop 0., 

RA Georges-Courbot M.-C, Barre-Sinoussi F., Fomsgaard A. ; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF035219; AAD44012.1; 

DR GO; GO: 0016021; Crintegral to membrane; IEA. 
GO; GO: 0004872; F: receptor activity; IEA. 

GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR Inter Pro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40658 MW; BC665891ED5661F8 CRC64; 

Query Match 41.4%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. le-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I 1 I I I I I I I I I : I I I I I I I I I II I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPLLYSLVFIFGFVGNILWLILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I II II I I I I I II I : I : : I 

Db 61 LKSMTD I YLLNLAI S DLLFLLTVP FWAH YAAAQWDFGNTMC — QLLT GL YFI GFFS GI FF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I : I I I I I II 

Db 119 IILLTIDRYLAIVIlAVFALKARTWFGVvTSVITWVVAVFASLPRIIFTRSQREGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I M I I I I I I I I I | | | ] | | : : | 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVT4VICYSGILKTLLRCRNEKKRHRAVMLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I II I I I I II : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


DR 
DR 


RESULT 12 
Q95NE8 

ID Q95NE8 PRELIMINARY; PRT ; 352 AA. 

AC Q95NE8; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor type 5. 

GN CCR5. 

OS Cercopithecus tantalus. 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Cercopithecus. 

OX NCBI_TaxID=60712; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99335215; PubMed=l 04 08730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.C., 

RA Diop O., Rigoulet J., Barre-Sinoussi F. , Fomsgaard A.; 

RT "Mutations in CCR5-coding sequences are not associated with SIV 

RT carrier status in African nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:931-939(1999). 

DR EMBL; AF081577; AAD45495.1; -. 

DR GO; GO: 0016021; C:integral to membrane; IEA. 

GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF000 01; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

SQ SEQUENCE 352 AA; 40576 MW; 0F18 69D9B7968DBB CRC64; 

Query Match 41.4%; Score 366; DB 6; Length 352; 

Best Local Similarity 33.9%; Pred. No. le-32; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I : II I I I II I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 8 3 

I I I I II I I I I I I I I I : I : : | 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I : I I I I I I I 

Db 119 IILLTIDRYLAIVliAVFALKARTvTFGVVTSVITWVVAVFASLPRIIFTRSQREGLHYTC 178 

QY 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I II I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAP YNIVLLLNTFQEFFGLNNCS S SNRLDQAMQV 281 


RESULT 13 
Q9XT14 

ID Q9XT14 PRELIMINARY; PRT; 352 AA. 

AC Q9XT14; 

DT 01-NOV-1999 (TrEMBLrel. 12, Created) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE CC chemokine receptor 5. 

GN CCR5 . 

OS Colobus guereza (Black-and-white colobus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Colobus. 

OX NCBI_TaxID=3354 8; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Spearman P.W., Mburu D.N., Graham B.S.; 

RT "Differential Utilization of CCR5 Molecules from Three East African 

RT Simian Species by the HIV-1 Envelope Glycoprotein."; 

RL Submitted (APR-1999) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF141639; AAD32684.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR Inter Pro; IPR00027 6; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PROTEIN_RECEP__Fl_l ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW Receptor. ~~ 

SQ SEQUENCE 352 AA; 40550 MW; 9B07 8EF04D34DB36 CRC64; 

Query Match 41.3%; Score 365; DB 6; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.3e-32; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I M I I I I I I I : I I I I I I I M : I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 8 3 

I I I I I I I I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: 1111:1111111 

Db H9 IILLTIDRYLAIWAVFALKARTATFGVVTSVITWWAVFASLPGIIFTRSQREGLHYTC 178 

Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I II I I I : : | 

Db 119 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I II I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


DR 
DR 


RESULT 14 
Q95NC6 

ID Q95NC6 PRELIMINARY; PRT; 352 AA. 

AC Q95NC6; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Trachypithecus johnii (hooded leaf monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; Colobinae; 

OC Trachypithecus. 

OX NCBI_TaxID=66063 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y. , Ryder O.A. , Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny. "; 

RL Submitted (AUG-1999) to the EMBL/GenBank/ DDBJ databases. 

DR EMBL; AF177883; AAK43366.1; 

DR GO; GO: 0016021; Crintegral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

GO; GO: 0001584; F: rhodopsin-like receptor activity; IEA. 

GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40462 MW; 52824E0322559F7F CRC64; 

Query Match 41.3%; Score 365; DB 6; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.3e-32; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

I I I I I I I I I I I : I I I I I I II I : I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I II I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I : I I I I II I 

Db 119 1 1 LLT I DRYLAI VHAVFAIjKARTVT FGVVT S VI TWWAVFAS LP GI I FT RS QREGLH YT C 178 

QY 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I II I I I I I II I I I I I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRSEKKRHRAVRLIF 238 


Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I II I I I I I I I I I I I I I : I 


23 9 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 


RESULT 15 
Q95NC8 

ID Q95NC8 PRELIMINARY; PRT; 352 AA. 

AC Q95NC8; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE C-C chemokine receptor 5. 

GN CCR5 . 

OS Colobus polykomos. 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; Colobinae; 

OC Colobus. 

OX NCBI_TaxID=9572; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A. , Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny. "; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AF177881; AAK43364.1; -. 

DR GO; GO: 0016021; C: integral to membrane; IEA. 

DR GO; GO: 0004872; F: receptor activity; IEA. 

DR GO; GO: 0001584; F : rhodopsin-like receptor activity; IEA. 

DR GO; GO: 0007186; P:G-protein coupled receptor protein signalin. . .; IEA. 

DR Inter Pro; IPR000276; GPCR__Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW Receptor. 

SQ SEQUENCE 352 AA; 40578 MW; 4366F149C3B4938F CRC64; 

Query Match 41.3%; Score 365; DB 6; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.3e-32; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 20 MD YQVS S P I YD IN Y YT SEP CQKINVKQ I AA 49 

I I I I I I I I I II : I I I I II I I I : I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I II I I II I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGI FF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I II : I I I I I I I 

Db 119 IILLTIDRYIA.IV11AVFALKARTWFGVVTSVITWVVAVFASLPGIIFTRSQREGLHYTC 178 

QY 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG- 124 

I I I I I II I I I I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 


I I I I I I I I I I I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 
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31 
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X 
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1 

X 

7 1 Q 7 UTTMaM 
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X 
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1 

X 
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1 
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41 

136.5 

15. 

4 

825 

1 

ZF28_MOUSE 

P10078 

mus musculu 

42 

135.5 

15. 

3 

643 

1 

ZN74 HUMAN 

Q16587 

homo sapien 

43 

133.5 

15. 

1 

636 

1 

ZF90_MOUSE 

Q61967 

mus musculu 

44 

132.5 

15. 

0 

642 

1 

ZN14_HUMAN 

P17017 

homo sapien 

45 

131.5 

14. 

9 

367 

1 

Z211_HUMAN 

Q13398 

homo sapien 


ALIGNMENTS 


RESULT 1 
CKR5_HUMAN 

ID CKR5_HUMAN STANDARD; PRT; 352 AA. 

AC P51681; 014692; 014693; 014695; 014696; 014697; 014698; 014699; 

AC 014700; 014701; 014702; 014703; 014704; 014705; 014706; 014707; 

AC 014708; 015538; Q9UPA4; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) 

DE (HIV-1 fusion co-receptor) (CHEMR13) (CD195 antigen) . 

GN CCR5 OR CMKBR5. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96241590; PubMed=8 6394 8 5 ; 

RA Samson M. , Labbe 0., Mollereau C, Vassart G. , Parmentier M. ; 

RT "Molecular cloning and functional expression of a new human 

RT CC-chemokine receptor gene."; 

RL Biochemistry 35:3362-3367(1996). 

RN [2] 


RP SEQUENCE FROM N.A. 

RX MEDLINE=96291862; PubMed=8663314 ; 

RA Raport C.J., Gosling J., Schweichart V.L., Gray P.W., Charo I.F.; 

RT "Molecular cloning and functional characterization of a novel human 

RT CC chemokine receptor (CCR5) for RANTES , MlP-lbeta, and MlP-lalpha . " ; 

RL J. Biol. Chem. 271:17161-17166(1996). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=962 95970; PubMed=8699119 ; 

RA Combadiere C. , Ahuja S.K., Tiffany H.L., Murphy P.M.; 

RT "Cloning and functional expression of CC CKR5, a human monocyte CC 

RT chemokine receptor selective for MIP-1 (alpha) , MlP-l(beta), and 

RT RANTES 

RL J. Leukoc. Biol. 60:147-152(1996). 

RN [ 4 ] 

RP SEQUENCE FROM N.A. 

RA McCombie W.R., Wilson R. , Chen E., Gibbs R. , Zuo L., Johnson D., 

RA Nhan M., Parnell L., Dedhia N., Ansari A., Mardis E., Schutz K. , 

RA Gnoj L., la Bastide M. , Kaplan N., Greco T., Touchman J., 

RA Muzny D., Chen C.N., Evans C. , Fitzgerald M., See L.H., Tang M. , 

RA Porcel B.M., Dragan Y. , Giacalone J. , Pae A., Powell E., 

RA Solinsky K.A. , Desilva U., Diaz-Perez S., Zhou X., Yu Y., 

RA Watanabe M. , Doggett N., Garcia D. , Sagripanti J.L.; 

RL Submitted (MAY-1997) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98001387; PubMed=9343222 ; 

RA Kuhmann S.E., Piatt E.J., Kozak S.L., Kabat D. ; 

RT "Polymorphisms in the CCR5 genes of African green monkeys and mice 

RT implicate specific amino acids in infections by simian and human 

RT immunodeficiency viruses."; 

RL J. Virol. 71:8642-8656(1997). 

RN [6] 

RP SEQUENCE FROM N.A. , AND POLYMORPHISMS. 

RX MEDLINE=98022612; PubMed=9359654; 

RA Zhang L., Carruthers CD., He T., Huang Y., Cao Y., Wang G., Hahn B., 

RA Ho D.D. ; 

RT "HIV type 1 subtypes, coreceptor usage, and CCR5 polymorphism."; 

RL AIDS Res. Hum. Retroviruses 13:1357-1366(1997). 

RN [7] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98049523; PubMed=9388201; 

RA Mummidi S., Ahuja S.S., McDaniel B.L., Ahuja S.K.; 

RT "The human CC chemokine receptor 5 (CCR5) gene. Multiple transcripts 

RT with 5 '-end heterogeneity, dual promoter usage, and evidence for 

RT polymorphisms within the regulatory regions and noncoding exons."; 

RL J. Biol. Chem. 272:30662-30671(1997). 

RN [8] 

RP SEQUENCE FROM N.A., AND VARIANT ARG-178. 

RA Magierowska M. , Barre-Sinoussi F. , Issafras H., Theodorou I., 

RA Debre P. ; 

RL Submitted (MAR-1998) to the EMBL/ GenBank/ DDB J databases. 

RN [9] 

RP SEQUENCE FROM N.A. 

RA Kopatz S.A., Aronstam R.S., Sharma S.V.; 

RT "cDNA clones of human proteins involved in signal transduction 

RT sequenced by the Guthrie cDNA resource center (www.cdna.org)."; 


RL Submitted (JAN-2003) to the EMBL/GenBank/DDBJ databases. 

RN [10] 

RP CHARACTERIZATION OF ITS HIV-1 RECEPTOR FUNCTION. 

RX MEDLINE=96260017; PubMed=8 64 9511 ; 

RA Deng H., Liu R. , Ellmeier W., Choe S., Unutmaz D., Burkhart M. , 

RA di Marzio P., Marmon S., Sutton R.E., Hill CM., Davis C.B., 

RA Peiper S.C., Schall T.J., Littman D.R., Landau N.R.; 

RT "Identification of a major co-receptor for primary isolates of 

RT HIV-1."; 

RL Nature 381:661-666(1996). 

RN [11] 

RP CHARACTERIZATION OF ITS HIV-1 RECEPTOR FUNCTION. 

RX MEDLINE=96260018; PubMed=8 649512 ; 

RA Dragic T., Litwin V. , Allaway G.P., Martin S.R., Huang Y. , 

RA Nagashima K.A. , Cayanan C, Maddon P. J., Koup R.A. , Moore J. P., 

RA Paxton W.A. ; 

RT "HIV-1 entry into CD4+ cells is mediated by the chemokine receptor 

RT CC-CKR-5."; 

RL Nature 381:667-673(1996). 

RN [12] 

RP SULFATION. 

RX MEDLINE=99189752; PubMed=l 0 08 98 82 ; 

RA Farzan M. , Mirzabekov T., Kolchinsky P., Wyatt R . r Cayabyab M. , 

RA Gerard N.P., Gerard C, Sodroski J., Choe H. ; 

RT "Tyrosine sulfation of the amino terminus of CCR5 facilitates HIV-1 

RT entry."; 

RL Cell 96:667-676(1999). 

CC FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

cc MIP-1-beta and rantes and subsequently transduces a signal by 

cc increasing the intracellular calcium ions level. May play a role 

cc in the control of granulocytic lineage proliferation or 

cc differentiation. Acts as co-receptor with CD4 for primary non- 

CC syncytium-inducing strains (NSI) (macrophage-tropic) of HIV-1 

CC virus. It promotes Env-mediated fusion of the virus. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- TISSUE SPECIFICITY: Found in promyelocytic cells. 

CC -!- PTM: Sulfation contributes to the efficiency of HIV-1 entry. 

CC -!- PTM: Modified by O-linked glycosylation, but not by N-linked 

CC glycosylation. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; X91492; CAA62796.1; -. 

DR EMBL; U54994; AAC50598.1; -. 

DR EMBL; U57840; AAB17071.1; 

DR EMBL; U95626; AAB57793.1; -. 

DR EMBL; U83326; AAC51797.1; -. 

DR EMBL; AF011500; AAB65700.1; -. 

DR EMBL; AF011501; AAB65701.1; -. 

DR EMBL; AF011502; AAB65702.1; -. 
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DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 


EMBL; AF011503; AAB65703.1 
EMBL; AF011505; AAB65705.1 
EMBL; AF011506; AAB65706.1 
EMBL; AF011507; AAB65707.1 
EMBL; AF011508; AAB65708.1 
EMBL; AF011509; AAB65709.1 
EMBL; AF011510; AAB65710.1 
EMBL; AF011511; AAB65711.1 
EMBL; AF011512; AAB65712.1 
EMBL; AF011513; AAB65713.1 
EMBL; AF011514; AAB65714.1 
EMBL; AF011515; AAB65715.1 
EMBL; AF011516; AAB65716.1 
EMBL; AF011517; AAB65717.1 
EMBL; AF011518; AAB65718.1 
EMBL; AF011519; AAB65719.1 
EMBL; AF011520; AAB65720.1 
EMBL; AF011521; AAB65721.1 
EMBL; AF011522; AAB65722.1 
EMBL; AF011523; AAB65723.1 
EMBL; AF011524; AAB65724.1 
EMBL; AF011525; AAB65725.1 
EMBL; AF011526; AAB65726.1 
EMBL; AF011527; AAB65727.1 
EMBL; AF011528; AAB65728.1 
EMBL; AF011529; AAB65729.1 
EMBL; AF011530; AAB65730.1 
EMBL; AF011531; AAB65731.1 
EMBL; AF011532; AAB65732.1 
EMBL; AF011533; AAB65733.1 
EMBL; AF011534; AAB65734.1 
EMBL; AF011535; AAB65735.1 
EMBL; AF011536; AAB65736.1 
EMBL; AF011537; AAB65737.1 
EMBL; AF031237; AAB94735.1 
EMBL; AF052539; AAD18131.1 
EMBL; AY221093; AA065971.1 
Genew; HGNC:1606; CCR5 . 
MIM; 601373; -. 
GO 
GO 
GO 
GO 
GO 
GO 
GO 
GO 
GO 
GO 
GO 
GO 


GO:0005768; C:endosome; TAS . 

GO:0005887; Crintegral to plasma membrane; TAS. 
GO: 0016493; F:C-C chemokine receptor activity; NAS . 
F:coreceptor activity; TAS. 
P: cell-cell signaling; TAS. 
P: cellular defense response; TAS. 
P : chemotaxis ; TAS . 

Prcytosolic calcium ion concentration elevation; TAS . 
P:G-protein coupled receptor protein signalin. . .; TAS, 
P: inflammatory response; TAS. 
P: invasive growth; TAS. 
GO: 0007203; P : phosphatidylinositol-4 , 5-bisphosphate hydro. . .; TAS. 
InterPro; IPR000276; GPCR^Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 1. 
PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation; 


GO:0015026; 
GO:0007267; 
GO: 0006968; 
GO: 0006935; 
GO:0007204; 
GO: 0007186; 
GO:0006954; 
GO: 0007125; 
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2 (POTENTIAL). 
EXTRACELLULAR (POTENTIAL) . 

3 (POTENTIAL). 


Query Match 42.9%; Score 379; DB 1; Length 352; 

Best Local Similarity 35.0%; Pred. No. 2.2e-29; 

Matches 99; Conservative 7; Mismatches 27; Indels 150; Gaps 7; 
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-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I II I I I II I : I : : I 


-LCTRSQKEGLHYTC 97 
: I I I I I I I I I I II 


■ Mi* r 1 DQYQr WKN FQTLKI HQRVHGG 124 

I I I I I M I I I I I II I I I I I : : I 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 


RESULT 2 
CKR5_PANTR 

ID CKR5_PANTR STANDARD; PRT; 352 AA. 

AC P56440; O02778; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (c-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Pan troglodytes (Chimpanzee) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos torni ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 
OX ' NCBIJTaxID=9598; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97268687; PubMed=910 8 095 ; 

RA Edinger A.L., Arnedee A., Miller K. , Doranz B.J., Endres M., 

RA Sharron M. , Samson M., Lu Z.H., Clements J.E., Murphey-Corb M. , 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W.; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4005-4010(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 


RA Zimmerman P. A., Buckler-White A., Alkhatib G. ; 

RL Submitted (MAR-1997) to the EMBL/ GenBank/ DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98022612; PubMed=9359654 ; 

RA Zhang L., Carruthers CD., He T., Huang Y., Cao Y. , Wang G. , Hahn B . , 

RA Ho D.D. ; 

RT "HIV type 1 subtypes, coreceptor usage, and CCR5 polymorphism."; 

RL AIDS Res. Hum. Retroviruses 13:1357-1366(1997). 
RN [4] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97426118; PubMed=9282822; 

RA Zacharova V., Zachar V., Goustin A.S.; 

RT "Sequence of chemokine receptor gene CCR5 in chimpanzees, a natural 

RT HIV type 1 host."; 

RL AIDS Res. Hum. Retroviruses 13:1159-1161(1997). 
RN [5] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98090115; PubMed=9430250; 

RA Pretet J.-L., Zerbib A., Girard M. , Guillet J.-G., Butor C; 

RT "Chimpanzee CXCR4 and CCR5 act as coreceptors for HIV type 1 . " ; 

RL AIDS Res. Hum. Retroviruses 13:1583-1587(1997). 
RN [6] 

RP SEQUENCE FROM N.A. 

RA Zhang Y., Ryder O.A., Zhang Y. ; 

RT "Sequence comparison of the CCR5 gene in primates and primate 

RT phylogeny."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/ DDB J databases. 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF005663; AAB62557.1; -. 

DR EMBL; U94329; AAB58446.1; 

DR EMBL; AF011542; AAB65742.1; -. 

DR EMBL; U97666; AAC51670.1; 

DR EMBL; AF011540; AAB65740.1; -. 

DR EMBL; U89797; AAC03717.1; -. 

DR EMBL; AF177894; AAK43377.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEINJRECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 
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SEQUENCE 

352 AA; 

40539 

MW; 4A33E698B80FE34C CRC64 ; 


Query Match 42.3%; Score 374; DB 1; Length 352; 

Best Local Similarity 34.6%; Pred. No. 6.6e-29; 

Matches 98; Conservative 8; Mismatches 27; Indels 150; Gaps 7; 

2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I II I I I I II I : I I I I I I I I I I I II I I | | 
1 MDYQVSSPIYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : | 

61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

84 LCTRSQKEGLHYTC 97 

: I I I II II II I I I 

119 1 1 LLT I DRYLAI VHAVFALKARTVT FGVVT S VI TWWAVFAS LPGI I FTRS QKEGLH YTC 178 

98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 12 4 

M I I I I I I I I II I I I I I I I I : : | 

179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I II I I I I I I I I I I II II : I 

239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 
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RESULT 3 
CKR5_HYLSY 

ID CKR5_HYLSY STANDARD; PRT; 352 AA. 

AC Q95NC5; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 
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OC 
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RX 
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RT 
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CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 


CCR5 OR CMKBR5. 

Hylobates syndactylies (Siamang) ( Symphalangia syndactylus ) . 
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates. 
NCBI_TaxID=9590; 
[1] 

SEQUENCE FROM N.A. 

MEDLINE=99416438; PubMed=l 04 8 697 0 ; 
Zhang Y.-W., Ryder O.A. , Zhang Y.-P.; 

"Sequence evolution of the CCR5 chemokine receptor gene in primates."; 
Mol. Biol. Evol. 16:1145-1154(1999). 

-!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
MIP-1-beta and RANTES and subsequently transduces a signal by 
increasing the intracellular calcium ions level. May play a role 
in the control of granulocytic lineage proliferation or 
differentiation . 

SUBCELLULAR LOCATION: Integral membrane protein. 

SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http : //www. isb-sib . ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF177884; AAK43367.1; -. 
InterPro; IPR000276; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l;' 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
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SEQUENCE 

352 AA; 

40508 MW 

; F4F64B3AD5AF658A CRC64; 


Query Match 41.7%; Score 369; DB 1; Length 352; 

Best Local Similarity 34.3%; Pred. No. 2e-28; 


Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7; 
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I I I I I I I I I I I I I I I I I I : I 


RESULT 4 
CKR5_PONPY 

ID CKR5_PONPY STANDARD; PRT; 352 AA. 

AC 097881; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Pongo pygmaeus (Orangutan) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pongo. 

OX NCBI_TaxID=9600; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=104 86970 ; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

cc in the control of granulocytic lineage proliferation or 

CC differentiation . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC . 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 
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Sulfation. 


Query Match 41.7%; Score 369; DB 1; Length 352; 

Best Local Similarity 34.3%; Pred. No. 2e-28; 

Matches 97; Conservative 8; Mismatches 28; Indels 150; Gaps 7; 
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RESULT 5 
CKR5 PAPHA 


ID CKR5_PAPHA STANDARD; PRT; 352 AA. 

AC P56441; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5). 
GN CCR5 OR CMKBR5. 

OS Papio hamadryas (Hamadryas baboon), and 
OS Papio anubis (Olive baboon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 
OC Cercopithecinae; Papio. 

OX NCBI_TaxID=9557, 9555; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC SPECIES=P . hamadryas ; 

RX MEDLINE=97268687; PubMed=9108 095 ; 

RA Edinger A.L., Amedee A., Miller K. , Doranz B.J., Endres M. , 

RA Sharron M. , Samson M. , Lu Z.H., Clements J.E., Murphey-Corb M. , 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W. ; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4005-4010(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC SPECIES=P . hamadryas ; 

RX MEDLINE=99210133; PubMed=10195758 ; 

RA Saksena N.K., Wang B., Novembre F.J., Bolton W. , Smit T.K., Lai R.B.; 

RT "Species-specific changes in the CCR5 gene from African and Asian 

RT nonhuman primates."; 

RL AIDS Res. Hum. Retroviruses 15:479-4 83(1999). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC SPECIES=P . anubis ; 

RA Benton P. A., Timanus D.K., Shearer M.H., Lee D.R., Kennedy R.C.; 

RL Submitted (NOV-1997) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

cc in the control of granulocytic lineage proliferation or 

CC differentiation . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF005658; AAB62552.1; -. 

DR EMBL; AF105287; AAD20556.1; -. 

DR EMBL; AF105288; AAD20557.1; -. 

DR EMBL; AF105289; AAD20558.1; -. 

DR EMBL; AF105290; AAD20559.1; -. 
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CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) . 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL). 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) . 

FT 

CARBOHYD 

268 

268 

N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ 

SEQUENCE 

352 AA; 

40489 MW 

; 5E1504A9BA1FE8B2 CRC64; 


Query Match 41.4%; Score 366; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 3.9e-28; 

Matches 96; Conservative 9; Mismatches 28; Indels 150; 


Gaps 


7; 


QY 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 


20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 
1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 

50 


49 


60 


83 


YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG 

I I I I I I II I I I II I I : I : : I 

61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 


84 


119 


LCTRSQKEGLHYTC 97 
: I I I I : I I I I I I I 

IILLTIDRYLAIVTIAVFALKARTWFGWrSVITWWAVFASLPGIIFTRSQREGLHYTC 17 8 


-HQRVHGG- 


124 


98 SSHFPYSQYQFWKNFQTLKI 

I M M I I I I I I I I I I I I I I I 
179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVKVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


125 


239 


GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 

I I I I I I I I I I I I I I I I I I : I 

TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 


154 


281 


RESULT 6 
CKR5 PYGBI 


ID CKR5_PYGBI STANDARD; PRT; 352 AA. 

AC 097880; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) 

GN CCR5 OR CMKBR5. 

OS Pygathrix bieti (Black snub-nosed monkey) ( Rhinopithecus bieti) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; Colobinae; 

OC Pygathrix. 

OX NCBI__TaxID=61621; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed-104 8 697 0 ; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

cc in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 


CC 
CC 


This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF075445; AAD19857.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1_2 ; 1. 

KW G-Drotein COUDled rP.r.Pnhnr i Trancmpmhrano' Cl u^^v^f „ . e„if„4.^- 


FT 

DOMAIN 

1 

30 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL) . 

FT 

DOMAIN 

59 

68 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL). 

FT 

DOMAIN 

90 

102 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) . 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL) . 

FT 

DOMAIN 

167 

198 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 

FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 


FT MOD_RES 

3 

3 


SULFATION 

(BY SIMILARITY) . 

r 1 MUD KHjD 

i r\ 

10 

10 


SULFATION 

(BY SIMILARITY) . 

r 1 MUD KH.O 

1 A 

14 

14 


SULFATION 

(BY SIMILARITY) . 

FT MOD~RES 

15 

15 


SULFATION 

(BY SIMILARITY) . 

SQ SEQUENCE 

352 AA; 

40585 MW; 

4366F142730F938F CRC64; 

Query Match 


41. 

3%; 

Score 365; 

DB 1; Length 352; 

Best Local 

Similarity 

33. 

6%; 

Pred. No. 

4. 9e-28; 


Matches 


Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 


95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I : I I I II I I I I : I M I I I I 
1 MDYQVSSPTYDIDYYTSEPCQKVNVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


50 


61 


84 


83 


YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 

I I I I I I I I I I I I I I I : I : : I 

LKSMTDI YLLNLAI SDLFFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFI GFFSGI FF 118 


LCTRSQKEGLHYTC 97 
: I I I I : I I I I I I I 

119 IILLTIDRY LAI VHAVF AL KART VT F GWT S VI T WWAVF AS LPGIIFTRSQREGLHYTC 178 


98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 

I I I M II I I I I I I I I I I I II : : I 

179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 

125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 

239 TIMI VYFLFWAPYNI VLLLNT FQEFFGLNNCS S SNRLDQAMQV 281 


124 


238 


RESULT 7 
CKR5_PYGNE 

ID CKR5_PYGNE STANDARD; PRT; 352 AA. 

AC 097882; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Pygathrix nemaeus (Dove langur) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Pygathrix. 

OX NCBI_TaxID=54133; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=10486970; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

cc increasing the intracellular calcium ions level. May play a role 

cc in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 



-!- SIMILARITY: Belongs to 

family 1 of G-protein coupled receptors. 


This SWISS-PROT entry is copyright. It is 

produced through a collaboration 

cc 

between 

the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

cc 

the European Bioinf ormatics 

Institute. There are no restrictions on its 

cc 

use by 

non-profit institutions as long 

as its content is in no way 

cc 

modified 

and this 

statement 

is not removed 

Usage by and for commercial 

cc 

entities 

requires 

a license 

agreement (See 

http : //www. isb-sib . ch/announce/ 

cc 
cc 

or send an email to license@isb-sib . ch) . 


DR 

EMBL; AF075448; AAD19860.1; 



DR 

InterPro; 

IPR000276; GPCR Rhodpsn. 


DR 

Pfam; PF00001; 7tm 1; 1. 



DR 

PRINTS; PR00237; GPCRRHODOPSN . 


DR 

PROSITE; 

PS00237; 

G_PROTEIN 

_RECEP_F1 1; 1. 


DR 

PROSITE; 

PS50262; 

G_PROTEIN 

_RECEP_F1 2; 1. 


KW 

G-protein 

coupled 

receptor; 

Transmembrane ; 

Glycoprotein; Sulfation . 

FT 

DOMAIN 

1 

30 

EXTRACELLULAR 

(POTENTIAL) . 

FT 

TRANSMEM 

31 

58 

1 (POTENTIAL) 


FT 

DOMAIN 

59 

68 

CYTOPLASMIC (] 

POTENTIAL) . 

FT 

TRANSMEM 

69 

89 

2 (POTENTIAL) 


FT 

DOMAIN 

90 

102 

EXTRACELLULAR 

(POTENTIAL) . 

FT 

TRANSMEM 

103 

124 

3 (POTENTIAL) 

FT 

DOMAIN 

125 

141 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

142 

166 

4 (POTENTIAL), 


FT 

DOMAIN 

167 

198 

EXTRACELLULAR 

(POTENTIAL) . 

FT 

TRANSMEM 

199 

218 

5 (POTENTIAL) . 


FT 

DOMAIN 

219 

235 

CYTOPLASMIC (POTENTIAL) . 

FT 

TRANSMEM 

236 

260 

6 (POTENTIAL), 


FT 

DOMAIN 

261 

277 

EXTRACELLULAR 

(POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 


FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

r J. 

DISULFID 

101 

178 

BY SIMILARITY. 


FT 

MOD_RES 

3 

3 

SULFATION (BY 

SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY 

SIMILARITY) . 

FT 

MOD RES 

14 

14 

SULFATION (BY 

SIMILARITY) . 

FT 

MOD__RES 

15 

15 

SULFATION (BY 

SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40532 MW; FE4F9D98D3B3E861 CRC64; 


Query Match 41.3%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.9e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

Qy 2 0 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 4 9 

llllllll I I I : I I II I I I I I : I I I I I I I 
Db 1 MDYQVS S PT YDI DYYTSEPCQKVNVKQI AARLLP PLYSLVFI FGFVGNI LWLI LINCKR 60 

50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHS YKCG 83 

I I I I I I I I I II I I I I : I : : I 

Db 61 LKSMTDI YLLNLAI S DLFFLLTVP FWAH YAAAQWDFGNTMC — QLLTGLYFIGFFSGI FF 118 

Qy 84 LCTRSQKEGLHYTC 97 

: I I I I : II I I I I I 

Db H9 IILLTIDRYLAIVmVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQREGLHYTC 178 

QY 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I II I I I I I I I I : : | 


179 SSHFPYSQYQFWKNFQTLKIVILGLVLPLLIMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I II I II I I I I I I I I : | 

Db 239 TIMIVTFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 

RESULT 8 
CKR5_TRAFR 

ID CKR5_TRAFR STANDARD; PRT; 352 AA. 

AC 097878; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) 

GN CCR5 OR CMKBR5 . 

OS Trachypithecus francoisi (Francois 1 langur) (Indochinese langur) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Trachypithecus . 

OX NCBI_TaxID=54 1 8 0 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=104 8697 0; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

cc increasing the intracellular calcium ions level. May play a role 

cc in tne control of granulocytic lineage proliferation or 

CC differentiation. 

SUBCELLULAR LOCATION: Integral membrane protein. 

SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 


CC - !- 
CC - ! - 


CC 
CC 


on 


This SWISS-PROT entry is copyright. It is produced through a collaborati 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF075442; AAD19854.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS50262; G_PR0TEIN_RECEP_F1~2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 

FT DOMAIN 1 3 0 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 31 58 1 (POTENTIAL) . 

FT DOMAIN 59 68 CYTOPLASMIC (POTENTIAL). 

FT TRANSMEM 69 8 9 2 (POTENTIAL) . 

FT DOMAIN 90 102 EXTRACELLULAR (POTENTIAL). 

FT TRANSMEM 103 124 3 (POTENTIAL). 

FT DOMAIN 125 141 CYTOPLASMIC (POTENTIAL) . 


FT 


I/O 

lOD 

FT 

UKJL y Lt\±iv 

lb/ 

198 

FT 

1 rsjf-UN DiYlHilYl 

1 QQ 

±y y 

218 

FT 

L. X 


z ±y 

235 

FT 

■C X 

TT3 ZiM CJMTTM 

ZOO 

260 

FT 

£ X 

IJUl v l/\±. JN 

Z ol 

277 

FT 


O "~? O 

z / o 

301 

FT 

X X 



352 

FT 

DT STIT.FTF) 

j. u ± 

1 1 Q 
I/O 

FT 

MOD_RES 

3 

3 

FT 

MOD_RES 

10 

10 

FT 

MOD_RES 

14 

14 

FT 

MOD_RES 

15 

15 

SQ 

SEQUENCE 

352 AA; 

404 


4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL). 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL). 

7 (POTENTIAL). 
CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 
SULFATION (BY SIMILARITY) . 
SULFATION (BY SIMILARITY) . 
SULFATION (BY SIMILARITY) . 
SULFATION (BY SIMILARITY) . 

40496 MW; 4366F148C255938F CRC64; 

Query Match 41.3%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.9e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7- 


Qy 

20 

Db 

1 

Qy 

50 

Db 

61 

Qy 

84 

Db 

119 

Qy 

98 

Db 

179 

Qy 

125 

Db 

239 


I I I I I : I I I I I I I I I : I I I I I I I 

MDYQVS S PT YDI DYYTSEPCQKVNVKQIAARLLPPLYSLVFI FGFVGNI LWXI LINCKR 60 


-YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

J 1 I I I I I I I I I I I I I : | : : | 


'"" LCTRSQKEGLHYTC 97 

IILLTIDRYLAI VIIAVFAiKARTVTFGW 178 

SSHFPYSQYQFWKNFQTLKI HQRVHGG 12 4 

I I I I I I I I II I I I I I I I I I I :: | 

SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : | 


RESULT 9 
CKR5_TRAPH 

ID CKR5_TRAPH STANDARD; PRT; 352 AA 

AC 097879; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) 

GN CCR5 OR CMKBR5. 

OS Trachypithecus phayrei (Phayre's leaf monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalra; Eutheria; Primates; Catarrhini; Cercopithecidae ; Colobinae; 

OC Trachypithecus. 

OX NCBI_TaxID=61618 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=1048 697 0 ; 


RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates 
RL Mol. Biol. Evol. 16:1145-1154(1999). " ' 

CC FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

cc in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF075443; AAD19855.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS00237; G_PR0TEIN_RECEP_F1_1 ; 1. 

DR PROSITE; PS502 62; G_PR0TEIN__RECEP_F1_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 

FT DOMAIN 1 30 EXTRACELLULAR (POTENTIAL) 

FT TRANSMEM 31 58 1 (POTENTIAL). 

FT DOMAIN 5 9 68 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 69 89 2 (POTENTIAL). 

FT DOMAIN 90 102 EXTRACELLULAR (POTENTIAL) . 

FT TRANSMEM 103 124 3 (POTENTIAL). 

FT DOMAIN 125 141 CYTOPLASMIC (POTENTIAL) . 

FT TRANSMEM 142 166 4 (POTENTIAL) . 

FT DOMAIN 167 198 EXTRACELLULAR (POTENTIAL) 

FT TRANSMEM 199 218 5 (POTENTIAL). 

FT DOMAIN 219 235 CYTOPLASMIC (POTENTIAL). 

FT TRANSMEM 236 260 6 (POTENTIAL). 

FT DOMAIN 261 277 EXTRACELLULAR (POTENTIAL) 

FT TRANSMEM 278 301 7 (POTENTIAL) . 

FT DOMAIN 302 352 CYTOPLASMIC (POTENTIAL). 

FT DISULFID 101 178 BY SIMILARITY. 

FT MOD_RES 3 3 SULFATION (BY SIMILARITY) . 

FT MOD_RES 10 10 SULFATION (BY SIMILARITY) . 

FT MOD_RES 14 14 SULFATION (BY SIMILARITY) . 

FT MOD_RES 15 15 SULFATION (BY SIMILARITY) . 

SQ SEQUENCE 352 AA; 40509 MW; 4366F14 8D3A5938F CRC64; 

Query Match 41.3%; Score 365; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 4.9e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 7; 

20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I : I I I I I II I I : I I I I I I I 
Db 1 ^DYQVSSPTYDIDYYTSEPCQKWVKQIAARLLPPLYSLVFIFGFVGNILVVLILINCKR 60 


Qy 


50 


YKCGLC 


AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 


Db 

61 

Qy 

84 

Db 

119 

Qy 

98 

Db 

179 

Qy 

125 

Db 

239 


1 1 


-LCTRSQKEGLHYTC 97 
: I I M : I I I I I I I 


SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I II I I I II I I I I I I I : ; I 


-GGSYKCGLC QEFFGLNNCS S SNRLDGHQRV 154 

I I M II I I I I I I I I I M I : I 


DT 
DT 


RESULT 10 
CKR5_GORGO 

ID CKR5_GORGO STANDARD; PRT; 352 AA. 

AC P56439; 

DT 15-JUL-1998 (Rel. 36, Created) 

15- JUL-1998 (Rel. 36, Last sequence update) 

16- OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Gorilla gorilla gorilla (Lowland gorilla) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Gorilla. 

OX NCBI_TaxID=9595; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97268687; PubMed=910 8 095 ; 

RA Edinger A.L., Amedee A., Miller K. , Doranz B.J., Endres M. , 

RA Sharron M. , Samson M. , Lu Z.H., Clements J.E., Murphey-Corb M. , 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W. ; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4005-4010(1997). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

cc in the control of granulocytic lineage proliferation or 

CC differentiation . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF005659; AAB62553.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm 1; 1. 


DR 

PRINTS; 

PR00237; 

GPCRRHODOPSN. 

DR 

PROSITE; 

PS00237; 

(j PROTEIN 

_RECEP_F1_1; 1. 

DR 

PROSITE; 

PS50262; 

G PROTEIN 

RECEP Fl 2; 1. 

KW 

G-protein coupled 

receptor; 

Transmembrane; Glycoprotein 
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FT 
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FT 
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FT 

TRANSMEM 
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218 
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FT 

DOMAIN 

219 
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CYTOPLASMIC (POTENTIAL) . 

FT 

1 RANSMEM 

236 

260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 

277 

EXTRACELLULAR (POTENTIAL) . 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

OCT 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

14 

14 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40515 MW 

; D0E6FCB9FE5EAC84 CRC64; 


Query Match 41.2%; Score 364; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 6.1e-28; 

Matches 96; Conservative 8; Mismatches 29; Indels 150; Gaps 7; 


Qy 

20 

Db 

1 

Qy 

50 

Db 

61 

Qy 

84 

Db 

119 

Qy 

98 

Db 

179 

Qy 

125 

Db 

239 


I I I M I I I I II : I | | | | | | | | | | | | | | | 

MDYQVSSPTYDIDYYTSEPCQKTNVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I I I I I I I I I I I : I : : | 


-LCTRSQKEGLHYTC 97 
: I I I I I I I I I I I I 


oSnr i"isyiQETVKNFQTLKI HQRVHGG 124 

I I I II I I I I I I I M I I I I I I : : | 


-GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I I : I 


RESULT 11 
CKR5_MACMU 

ID CKR5_MACMU STANDARD; PRT; 352 AA. 

AC P79436; 002746; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 


DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Macaca mulatta (Rhesus macaque), 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey), and 

OS Macaca nemestrina (Pig-tailed macaque) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9544, 9541, 9545; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC SPECIES=M. mulatta; 

RX MEDLINE=97184592; PubMed=9032394 ; 

RA Marcon L., Choe H., Martin K.A., Farzan M. , Ponath P.D., Wu L., 

RA Newman W. , Gerard N., Gerard C, Sodroski J. ; 

RT "Utilization of C-C chemokine receptor 5 by the envelope 

RT glycoproteins of a pathogenic simian immunodeficiency virus, 

RT SIVmac239."; 

RL J. Virol. 71:2522-2527(1997). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC SPECIES=M. mulatta; STRAIN-Indian macaque; 

RX MEDLINE=97213934; PubMed=9060623 ; 

RA Chen Z., Zhou P., Ho D.D., Landau N.R., Marx P. A.; 

RT "Genetically divergent strains of simian immunodeficiency virus use 

RT CCR5 as a coreceptor for entry."; 

RL J. Virol. 71:2705-2714(1997). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC SPECIES-M. mulatta; 

RX MEDLINE=21354176; PubMed=11461684 ; 

RA Margulies. B.J., Hauer D.A., Clements J.E.; 

RT "Identification and comparison of eleven rhesus macaque chemokine 

RT receptors."; 

RL AIDS Res. Hum. Retroviruses 17:981-986(2001). 

RN [4] 

RP SEQUENCE FROM N.A. 

RC SPECIES=M. mulatta, M. fascicularis , and M. nemestrina; 

RX MEDLINE=97268687; PubMed=9108095 ; 

RA Edinger A.L., Amedee A., Miller K., Doranz B.J., Endres M. , 

RA Sharron M. , Samson M. , Lu Z.H., Clements J.E., Murphey-Corb M. , 

RA Peiper S.C., Parmentier M. , Broder C.C., Doms R.W.; 

RT "Differential utilization of CCR5 by macrophage and T cell tropic 

RT simian immunodeficiency virus strains."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:4 005-4010(1997). 

CC FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

cc in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 


cc 
cc 
cc 
cc 
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DR 
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DR 
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FT 

FT 

FT 

FT 

FT 

SQ 


use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; U77672; AAC51109.1; 
EMBL; U73739; AAC51158.1; 
EMBL; U96762; AAC34132.1; -. 
EMBL; AF005660; AAB62554.1, 
EMBL; AF005661; AAB62555.li 
EMBL; AF005662; AAB62556.1, 
InterPro; IPR000276; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PROSITE; PS 002 37; G_PR0TEIN_RECEP_F1_1 ; 1. 
PROSITE; PS 5 02 62; G_PROTEIN_RECEP_Fl_2 ; 1. 

G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 

DOMAIN 1 3 0 EXTRACELLULAR (POTENTIAL) 

TRANSMEM 31 58 1 (POTENTIAL). 

DOMAIN 59 68 CYTOPLASMIC (POTENTIAL) . 

TRANSMEM 69 89 2 (POTENTIAL). 

DOMAIN 9 0 102 EXTRACELLULAR (POTENTIAL) 

TRANSMEM 103 124 3 (POTENTIAL). 

DOMAIN 125 141 CYTOPLASMIC (POTENTIAL) 

TRANSMEM 142 166 4 (POTENTIAL). 

DOMAIN 167 198 EXTRACELLULAR (POTENTIAL) 

TRANSMEM 199 218 5 (POTENTIAL). 

DOMAIN 219 235 CYTOPLASMIC (POTENTIAL) . 

TRANSMEM 236 260 6 (POTENTIAL). 

DOMAIN 261 277 EXTRACELLULAR (POTENTIAL) 

TRANSMEM 278 301 7 (POTENTIAL). 

DOMAIN 302 352 CYTOPLASMIC (POTENTIAL) 

DISULFID 101 178 BY SIMILARITY. 

MOD_RES 3 3 SULFATION (BY SIMILARITY) . 

MOD_RES 10 10 SULFATION (BY SIMILARITY) . 

MOD_RES 14 14 SULFATION (BY SIMILARITY) . 

MOD_RES 15 15 SULFATION (BY SIMILARITY) 

CARBOHYD 268 268 N-LINKED (GLCNAC. . .) (POTENTIAL) 

CONFLICT 241 241 M -> I (IN REF. 3) 

CONFLICT 292 292 I -> M (IN REF. 3). 

SEQUENCE 352 AA; 40507 MW; 58B96C85909FACB2 CRC64; 


Qy 

Db 

Qy 

Db 

Qy 

Db 


Query Match 41.1%; Score 363; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 7.6e-28; 

Matches 95; Conservative 10; Mismatches 28; Indels 150; Gaps 

20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA 49 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 

1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


7; 


50 


-YKCGLC- 


83 


AAAQWDFGNTMCQHQRVHGHHHHSYKCG-- 
1 1 I I I I I I I I I M I I : I : : I 

61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC — QLLTGLYFIGFFSGIFF 118 


84 


LCTRSQKEGLHYTC 97 

1in m t T"\r* : M I I : I I I I I I I 

119 IILLTIDRYLAIVliAVFALKARTVTFGVVTSVITWVVAVFASLPGIIFTRSQREGLHYTC 178 


Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I II I I II II I I I I II : : : | 

Db 179 SSHFPYSQYQFWKNFQTLKMVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I II I I I I I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 12 
CKR5_HYLML 

ID CKR5_HYLML STANDARD; PRT; 352 AA. 

AC Q95NC0; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5). 

GN CCR5 OR CMKBR5. 

OS Hylobates moloch (Silvery gibbon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates 

OX NCBIJTaxID=81572; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=10486970; 

RA Zhang Y.-W., Ryder O.A., Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

cc in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF177899; AAK43382.1; 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm__l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

*ECEP_F1_1; 1. 
*ECEP_F1_2; 1. 

Transmembrane; Glycoprotein; Sulfation. 
EXTRACELLULAR (POTENTIAL) . 

1 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

2 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL). 


DR 

PROSITE; 

PS00237; 

G_PR0TEIN 

DR 

PROSITE; 

PS50262; 

G_PR0TEIN 

KW 

G-protein 

coupled 

receptor; ' 

FT 
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FT 
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FT 
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TRANSMEM 
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FT 
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FT 

TRANSMEM 

199 

218 

FT 

DOMAIN 
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FT 

TRANSMEM 
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260 

FT 

DOMAIN 

261 

277 

FT 

TRANSMEM 

278 

301 

FT 

DOMAIN 
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352 

FT 

DISULFID 

101 

178 

FT 

MOD_RES 

3 

3 

FT 

MOD_RES 

10 

10 

FT 

MOD RES 

14 

14 

SQ 

SEQUENCE 

352 AA; 

40< 


3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL). 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) . 

5 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR. (POTENTIAL) . 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 
BY SIMILARITY. 
SULFATION (BY SIMILARITY) . 
SULFATION (BY SIMILARITY) . 
SULFATION (BY SIMILARITY) . 

MW; 9623CA98340CF274 CRC64; 

Query Match 41.0%; Score 362; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 9.5e-28; 

Matches 96; Conservative 8; Mismatches 29; Indels 150; Gaps 7; 

QY 2 0 MD YQVS S P I YD I N Y YT S E P C QK I NVKQ I AA 49 

I I M I I I I I I I : I I I I I I I I I I I I I I I I 
Db 1 MDYQVSSPTYDIDYYTSGPCQKINVKQIAARLLPPLYSLVFIFGFVGNMLVILILINCKR 60 

QY 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I I I I II I I I I I I I I : I : : I 

Db 61 LKSMTDIYLLNLAISDLFFLLTVPFWAHYAAAQWDFGNTMC--QLLTGLYFIGFFSGIFF 118 

Qy 8 4 _ LCTRSQKEGLHYTC 97 

: I I I I M I I I I I I 

Db 119 IILLTIDRYLAIVIiAVFALKARTWFGVVTSVITWVVAVFASLPGIIFTRSQKEGLHYTC 178 

QY 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I II II I II I I I I I I I I I I : : I 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Qy 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I II I I I I II I I I I : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 13 
CKR5_CERPY 

ID CKR5_CERPY STANDARD; PRT; 352 AA. 

AC Q9TV42; 

DT 15-MAR-2004 (Rel. 43, Created) 

DT 15-MAR-2004 (Rel. 43, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) . 

GN CCR5 OR CMKBR5. 

OS Cercopithecus pygerythrus (Vervet monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI_TaxID=60710; 
RN [1] 

RP SEQUENCE FROM N.A. 


RT 
RT 
RL 


RC STRAIN=100; 

RX MEDLINE=99335215; PubMed=104 08730 ; 

RA Muller-Trutwin M.C., Corbet S., Hansen J., Georges-Courbot M.-C, 

RA Diop 0., Rigoulet J., Barre-Sinoussi F. , Fomsgaard A. ; 

"Mutations in CCR5-coding sequences are not associated with SIV 
carrier status in African nonhuman primates."; 
AIDS Res. Hum. Retroviruses 15:931-939(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 

CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF035222; AAD44015.1; -. 

DR InterPro; IPR000276; GPCR_Rhodpsn . 

DR Pfam; PF00001; 7tm_l; 1. 

DR PRINTS; PR00237; GPCRRHODOPSN . 

DR PROSITE; PS 00237; G_PR0TEIN_RECEP_F1__1 ; 1. 

DR PROSITE; PS502 62; G_PROTEIN_RECEP_Fl_2 ; 1. 

KW G-protein coupled receptor; Transmembrane; Glycoprotein; Sulfation. 
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FT 
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FT 
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FT 

TRANSMEM 
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260 

6 (POTENTIAL) . 

FT 

DOMAIN 

261 
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EXTRACELLULAR (POTENTIAL). 

FT 

TRANSMEM 

278 

301 

7 (POTENTIAL) . 

FT 

DOMAIN 

302 

352 

CYTOPLASMIC (POTENTIAL) . 

FT 

DISULFID 

101 

178 

BY SIMILARITY. 

FT 

MOD_RES 

3 

3 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

10 

10 

SULFATION (BY SIMILARITY) . 

FT 

MOD_RES 

15 

15 

SULFATION (BY SIMILARITY) . 

SQ 

SEQUENCE 

352 AA; 

40558 

MW; EF17D67CBCCC3DB0 CRC64; 


Query Match 40.8%; Score 361; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.2e-27; 

Matches 95; Conservative 9; Mismatches 29; Indels 150; Gaps 7; 


Qy 


20 MDYQVSSPIYDINYYTSEPCQKINVKQIAA- 

MINIM I II : I I I I I I I I I M I I I M I 


49 


Db 1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 

Qy 50 YKCGLC AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

I I . I I I I I II II I I I I : | : : | 

Db 61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 


Qy 84 


LCTRSQKEGLHYTC 97 

n , , , n : I I I I : I I I I I I 

Db 119 II LLTIDRYIAIVHAVFALKARTVTFGVWSVITWWAVFASLPRIIFTRSQREGLHYAC 178 

2y 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I I I I I I I I I I I I I I II I I I : : | 

Db 179 SSHFPYSQYQFWKNFQTLKI VI LGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I I I I I I I I I I I I | : | 

Db 23 9 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 2 81 

RESULT 14 
CKR5_CERTO 

ID CKR5_CERTO STANDARD; PRT; 352 AA. 

AC 062743; 062744; 062745; 062746; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) 

GN CCR5 OR CMKBR5. 

OS Cercocebus torquatus atys (Red-crowned mangabey) (Sooty mangabey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercocebus. 

OX NCBI_TaxID=9531; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Isolate 079, 085, 087, and 089; 

RX MEDLINE=98321155; PubMed=9656999 ; 

RA Chen Z., Gettie A., Ho D.D., Marx P. A. ; 

RT "Primary SIVsm isolates use the CCR5 coreceptor from sooty mangabeys 

RT naturally infected in west Africa: a comparison of coreceptor usage 

RT of primary SIVsm, HIV-2, and SIVmac."; 

RL Virology 246:113-124(1998). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

cc increasing the intracellular calcium ions level. May play a role 

CC in the control of granulocytic lineage proliferation or 

CC differentiation . 

SUBCELLULAR LOCATION: Integral membrane protein. 
SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 
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CC -!- 


CC 
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entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 


cc 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

DR 

KW 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

FT 

SQ 


EMBL; AF051902; 

EMBL; AF051903; 

EMBL; AF051904; 

EMBL; AF051905; 


AAC39830.1; 
AAC39831.1; 
AAC39832.1; 
AAC39833.1; -, 


InterPro; IPR000276; GPCR_Rhodpsn . 
Pfam; PF00001; 7tm_l; 1. 
PRINTS; PR00237; GPCRRHODOPSN . 
PROSITE; PS00237; G_PROTEIN_RECEP_Fl_l ; 
PROSITE; PS50262; G_PROTEIN RECEP Fl 2; 


1. 

1, 


G-protein coupled receptor; Transmembrane; 


DOMAIN 
TRANSMEM 
DOMAIN 
TRANSMEM 
DOMAIN 
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DOMAIN 
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DOMAIN 
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TRANSMEM 
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DISULFID 
MOD_RES 
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MOD_RES 
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1 
31 
59 
69 
90 
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219 
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278 
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3 
10 
14 
15 
2 
3 
25 
100 
107 
134 
146 
340 
352 AA; 


30 
58 
68 
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40489 MW; 


Glycoprotein; 
(POTENTIAL) . 


EXTRACELLULAR 

1 (POTENTIAL) 
CYTOPLASMIC (POTENTIAL) . 

2 (POTENTIAL). 
EXTRACELLULAR (POTENTIAL) 

3 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL). 

4 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) 

5 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
EXTRACELLULAR (POTENTIAL) 

7 (POTENTIAL) . 
CYTOPLASMIC (POTENTIAL) . 
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SULFATION (BY SIMILARITY) 
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ISOLATE 
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ISOLATE 
ISOLATES 


Sulfation. 
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AND 089) 


ISOLATE 079) 


20A196E2D47E49CA CRC64; 


Query Match 40.8%; Score 361; DB 1; Length 352; 

Best Local Similarity 33.6%; Pred. No. 1.2e-27; 
Matches 95; Conservative 9; Mismatches 


Qy 

Db 

Qy 

Db 

Qy 

Db 


29; Indels 15 0; Gaps 7; 

20 MD YQVS S P I YD I N Y YT SEP CQ KI NVKQ I AA 

I I I I I I I I I I I : I I I I I I I I I I I | | | | | | 
1 MDYQVSSPTYDIDYYTSEPCQKINVKQIAARLLPPLYSLVFIFGFVGNILWLILINCKR 60 


49 


50 


-YKCGLC- 


AAAQWDFGNTMCQHQRVHGHHHHSYKCG 83 

' I I I I I I I I I II I I | : | : : | 

61 LKSMTDIYLLNLAISDLLFLLTVPFWAHYAAAQWDFGNTMC— QLLTGLYFIGFFSGIFF 118 


84 


LCTRSQKEGLHYTC 97 
: I I I I : I I I I I I I 

119 IILLTIDRYLAIVHAVFALKAJ^TVTFGVVTSVITWWAVFASLPGIIFTRSQREGLHYTC 17 8 


Qy 98 SSHFPYSQYQFWKNFQTLKI HQRVHGG 124 

I I M I I I I I I M I I II I I I : : | 

Db 179 SPHFPYSQYQFWKNFQTLKIVILGLVLPLLVMVICYSGILKTLLRCRNEKKRHRAVRLIF 238 

Q ¥ 125 GGSYKCGLC QEFFGLNNCSSSNRLDGHQRV 154 

I I I I I I M I I I I I M I II : I 

Db 239 TIMIVYFLFWAPYNIVLLLNTFQEFFGLNNCSSSNRLDQAMQV 281 


RESULT 15 
CKR5_HYLLE 

ID CKR5_HYLLE STANDARD; PRT; 352 AA. 

AC 097883; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE C-C chemokine receptor type 5 (C-C CKR-5) (CC-CKR-5) (CCR-5) (CCR5) 

GN CCR5 OR CMKBR5 . 

OS Hylobates leucogenys (White-cheeked gibbon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hylobatidae; Hylobates 

OX NCBI_TaxID=61853; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99416438; PubMed=1048697 0 ; 

RA Zhang Y.-W., Ryder O.A. , Zhang Y.-P.; 

RT "Sequence evolution of the CCR5 chemokine receptor gene in primates."; 

RL Mol. Biol. Evol. 16:1145-1154(1999). 

CC -!- FUNCTION: Receptor for a C-C type chemokine. Binds to MIP-1-alpha, 
CC MIP-1-beta and RANTES and subsequently transduces a signal by 

CC increasing the intracellular calcium ions level. May play a role 

cc in the control of granulocytic lineage proliferation or 

CC differentiation. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

:c -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 
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CC Thxs SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 
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CYTOPLASMIC (POTENTIAL) . 

4 (POTENTIAL). 
EXTRACELLULAR (POTENTIAL) 

5 (POTENTIAL). 
CYTOPLASMIC (POTENTIAL) . 

6 (POTENTIAL) . 
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7 (POTENTIAL) . 
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SULFATION (BY SIMILARITY) 
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4F8E4F344CEB7C91 CRC64; 


Query Match 40.6%; Score 359; DB 1; Length 352; 

Best Local Similarity 33.9%; Pred. No. 1.8e-27; 
Matches 96; Conservative 8; Mismatches 


29; Indels 150; Gaps 7; 
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